A Survey of Weighted Polynomial Approximation with Exponential 

Weights 

D. S. Lubinsky 
1 January 2007 

Abstract 

Let W : K — ► (0, 1] be continuous. Bernstein's approximation problem, posed in 1924, 
deals with approximation by polynomials in the weighted uniform norm / — > H/WHi^fR)- The 
qualitative form of this problem was solved by Achieser, Mergelyan, and Pollard, in the 1950's. 
Quantitative forms of the problem were actively investigated starting from the 1960's. We 
survey old and recent aspects of this topic, including the Bernstein problem, weighted Jackson 
and Bernstein Theorems, Markov-Bernstein and Nikolskii inequalities, orthogonal expansions 
and Lagrange interpolation. We present the main ideas used in many of the proofs, and different 
techniques of proof, though not the full proofs. The class of weights we consider is typically 
even, and supported on the whole real line, so we exclude Laguerre type weights on [0, oo). 
Nor do we discuss Saff's weighted approximation problem, nor the asymptotics of orthogonal 
polynomials. 
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1 Bernstein's Approximation Problem 

The first quarter of the twentieth century was a great period for approximation theory. In that time, 
Dunham Jackson and Sergei Bernstein completed their great works on the degree of approximation. 
Miintz proved his theorem on approximation by powers {x Xj }°°_ Q , solving a problem of Bernstein, 
Faber introduced Faber polynomials and Faber series, and s'zego was developing the theory of 
orthogonal polynomials on the unit circle. Right at the end of that quarter, in 1924, Bernstein 
[3] came up with a problem that became known as Bernstein's approximation problem, and whose 
ramifications continue to be explored to this day. 
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One can speculate that one day Bernstein must have felt confined in approximating on bounded 
intervals, and so asked: by Weierstrass, we know that we can uniformly approximate any continuous 
function on a compact interval by polynomials. Are there analogues on the whole real line? The 
first thing to deal with is the unboundedness of polynomials on unbounded intervals. Clearly we 
need to damp the growth of a polynomial at infinity, by multiplying by a weight. For example, 
consider 

P(x)exp(-x 2 ), 
where P is a polynomial, or more generally, 

P(x)W(x). 

Here W must decay sufficiently fast at ±00 to counteract the growth of every polynomial. That is, 

lim x n W(x) = 0, n = 0,1,2,... . (1.1) 

\x\^oo 

What can be approximated, and in what sense? This problem is known as Bernstein's ap- 
proximation problem. A more precise statement is as follows: let W : R — > [0, 1] be measurable. 
When is it true that for every continuous / : R — ► R with 

lim {fW){x) = 0, 

there exists a sequence of polynomials {P n }^ =1 with 

lim \\(f-P n )W\\ Loom = 07 

n— >oo v ' 

If true we then say that the polynomials are dense, or that Bernstein's problem has a positive 
solution. The restriction that fW has limit at ±00 is essential: if x k W(x) is bounded on the real 
line for every non-negative k, then x k W(x) has limit at ±00 for every such k, and so the same 
is true of every weighted polynomial PW. So we could not hope to approximate, in the uniform 
norm, any function / for which fW does not have limit at ±00. 

When W vanishes on a set of positive measure, or is not continuous, extra complications ensue. 
So in the sequel, we shall assume that W is both positive and continuous. The more general case 
is surveyed at length in [67]. The case where we approximate only on a countable set of points is 
included in that study. 

In his early works on the problem, Bernstein often assumed that 1/W is the restriction to the 
real line of an even entire function with positive (even order) Maclaurin series coefficients. Other 
major contributors were N. I. Akhiezer, K. I. Babenko, L. de Branges, L. Carleson, M. M. Dzrbasjan, 
T. Hall, S. Izumi, T. Kawata, S. N. Mergelyan, and V. S. Videnskii. Yes, that's Lennart Carleson, 
and the same Mergelyan of Mergelyan Theorem fame (on uniform approximation by polynomials 
on compact subsets of the plane). 

Bernstein's approximation problem was solved independently by Achieser, Mergelyan, and Pol- 
lard, in the 1950's. Mergelyan [67, p. 147] introduced a regularization of the weight 

U(z) = sup i|P(z)| : P a polynomial and sup - — - < 1 
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Theorem 1.1 (Mergelyan) Let W : R — ► (0,1] be continuous and satisfy (1.1). There is a 
positive answer to Bernstein's problem iff 

7T-dt = oo. 

1 + t 2 

In another formulation, there is a positive answer iff 

£l(z) = oo 

for at least one non-real z (and then £l(z) = oo for all non-real z). We shall outline the proof of 
this in the next section. 

Akhiezer (and perhaps Bernstein?) |67[ p. 158] used instead the regularization 

W*(z) = sup{|P(z)| : P a polynomial with H-PI^Hl^r) < 1} • 

Theorem 1.2 (Akhiezer) Let W : R — > (0, 1] be continuous and satisfy (1.1). There is a positive 
answer to Bernstein 's problem iff 

logW,(t) 

-at = oo. 



1 + t 2 

Finally, Pollard [161], [67l p. 164] showed: 

Theorem 1.3 (Pollard) Let TV : R — ► (0, 1] be continuous and satisfy (1.1). There is a positive 
answer to Bernstein's problem iff both 



f 

J — ( 



\og(l/W(t)) 

-at = oo 



1 + t 2 

and there exists a sequence of polynomials {P n } such that for each x, 

lim P n (x)W(x) = 1, 

n^oo 

while 

aup\\P n W\\ LoB Qn) < oo. 

n>l 

Pollard later 11621 reformulated this as 



r °° log|P(x)| 

sup <> / k — dx : P a polynomial with ||-PW|| < 1 \ = oo. 



— oo 



1 + X 2 



Of course, these are not very transparent criteria. When the weight is in some sense regular, 
simplifications are possible. 

Theorem 1.4 Let W be even, and log (1/W(e x )) be convex. There is a positive answer to Bern- 
stein 's problem iff 

jgfl/gp* = oo. (1.2) 
1 + x z 



Weighted Approximation 



5 



This result was proved by Lennart Carleson in 1951 [TO], although Misha Sodin pointed out 
to the author that it appeared in a 1937 paper of Izumi and Kawata [51]. On perusing the latter 
paper, I agree that it is clearly implicit in the results there, though not explicitly stated. I have 
also seen the result attributed to M. Dzrbasjan. The reader should conclude that the "history" 
presented in this survey is by no means authoritative, but merely what could be deduced (often 
from secondary sources) in the available time. 

Corollary 1.5 Let a > and 

W a (x) = exp(-|x| a ). (1.3) 
There is a positive answer to Bernstein's problem iff a > 1. 

As regards necessary conditions, Hall showed that (1.2) is necessary for density. When density 
fails, only a limited class of entire functions can be approximated [70] . A comprehensive treatment 
of this topic is given in Koosis' book [67] . A concise elegant exposition appears in [86\ p. 28 ff.]. A 
more abstract solution to Bernstein's problem was given by Louis de Branges (of Bieberbach fame) 
in 1959 [29]. The ideas in that paper have had several recent ramifications [174] , [175]. 

What about extensions to L p ? At least when W is continuous, the answer is the same: 

Theorem 1.6 Let W : R — > (0, 1] be continuous and satisfy (1.1). Let p > 1. Then any of the 
conditions of Achieser, Mergelyan, or Pollard is necessary and sufficient so that for every measurable 
f : R — ► R with \\fW\\ Lp ^ < oo, there exist polynomials {P n } with 

lim\\(f-P n )W\\ Lp{R) =0. 

There is even a generalization of this theorem to weighted Miintz polynomials [188]. As regards 
the proof of the L p case, let us quote Koosis |67} p. 211]: "In general, in the kind of approximation 
problem considered here, (that of the density of a certain simple class of functions in the whole 
space), it makes very little difference which L p norm is chosen. If the proofs vary in difficulty, they 
are hardest for the L\ norm or the uniform norm." Indeed, in his 1955 paper |162j . Pollard added 
about two pages to deal with the L p case. 

In the special case of L2, there are connections to the classical moment problem. Let {st} be a 
sequence of real numbers. If there is a positive measure a on the real line such that 

/oo 
t k da(t), £ = 0,1,2,... , (1.4) 
-00 

then we say that the Hamburger Moment Problem for {s^} has a solution. If there is only one 
solution a, of the equations (1.4), we say the moment problem is determinate. Existence of a is 
equivalent to positivity of certain determinants. The determinacy of the moment problem is often 
a deeper and more difficult issue. If we define 

T,(z) = sup jjP^)! 2 : P a polynomial with J \P\ 2 da < 1^ , 

then a necessary and sufficient condition for determinacy is [67\ p. 141] 

log+ E(t) 



1 + t 2 



-dt = 00. 
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Here log + x := max {0, log x}. In particular, for the measure 

da(t) = exp(-\t\ a )dt, 

one can use this to show that the corresponding moment problem is determinate iff a > 1. Recall 
that Bernstein's approximation problem also had a positive solution iff a > 1. 

There are still closer connections to Bernstein's problem [38} Thm. 4.3, p. 77]. It is noteworthy 
that this was obtained by M. Riesz in 1923 even before Bernstein's problem was formulated. 

Theorem 1.7 Suppose that a is a positive measure on the real line, with hnite moments {s^, as in 
(1.4). Suppose, moreover, that a has no point masses, so that x i— > J_ da is a continuous function 
of x. Then the moment problem associated with {sfc} is determinate iff for every a-measurable 
function f with J f 2 da finite, and for every e > 0, there exists a polynomial P such that 

/oo 
(/ - P) 2 da < e. 
-oo 

Without assuming the continuity of J da, determinacy still implies density of the polynomials 
[381 P- 74 ff.]. There is also an important result on one-sided approximation [38} Theorem 3.3, 
p. 73]: 

Theorem 1.8 Suppose that a is a positive measure on the real line, with finite moments {s/c}, as 
in (1.4). Suppose moreover, that the moment problem associated with {sk} is determinate. Let 
e > and / : R — > M be a function that is Riemann-Stieltjes integrable against da over every finite 
interval, and improperly Riemann integrable over the whole real line, and of polynomial growth at 
oo. Then there exist polynomials R and S such that 

R< f < S in R 

and 

/oo 
(5 - R)da < e. 
-oo 

Some recent work related to Bernstein's approximation problem and its L2 analogue appear in 

is], 0, 0, [79], [eh], Hsu, [TB3], ieh, ra, ra, ra, nzg, ess]. 

2 Some Ideas for the Resolution of Bernstein's Problem 



In this section, we present some of the ideas used by Akhiezer, Mergelyan, Pollard, to resolve Bern- 
stein's Approximation problem. It's relatively easy to derive the necessary parts of the conditions. 
We follow [67J p. 147 ff.] and [861 P- 28 ff.]. Recall Mergelyan's regularization of W : 

0,(z) = sup I |P(z)| : P a polynomial and sup - — - — < 1 

In the sequel, C,C\,Ci, . . . denote positive constants independent of n,x,t and polynomials P of 
degree < n. The same symbol does not necessarily denote the same constant in different occur- 
rences. We write C = C(a) or C / C(a) to respectively show that C depends on a, or does not 



Weighted Approximation 



7 



depend on a. We use the notation ~ in the following sense: given sequences of real numbers {c n } 
and {d n }, we write 

if for some positive constants C\, C2 independent of n, we have 

Ci < c n /d n < C 2 . 



Lemma 2.1 In order that Bernstein's approximation problem has a positive solution, it is neces- 
sary that for all z £ C\R, 

n(z) = oo. (2.1) 



Proof. Suppose that Bernstein's problem has a positive solution. Fix z G C\BL Since the function 
f(t) = (t — z)~ l is continuous, and fW has limit at 00, we can find polynomials {R n } such that 



5 n = \\(f-R n )W\\ 1 



0, 



(2.2) 



as n — > 00. Let 



Then for all t <E R, 



Pn{t)= l - {t - Z)Rn{t) ~U-Rn){t). 



Pn(t) 



W(t) 



(t-z) 

Next, for some constant C independent of t, we have 

t - z 



5„ 



l|/-i? n |(^)<l. 

On 



t-i 



Then 



\Pn{t)W{t)\ 

sup — = sup 



< C, teR. 

t-Z P n (t) 



t-i 



(t-z) 



W(t) 



< C. 



It follows that P n /C is one of the polynomials considered in forming the sup in Q(z), so 



n(z) > 



Pn(z) 



c 



1 



5 n C 



00, 



n — > 00. □ 



Let's think how we can reverse this to show the condition is also sufficient. If £l(z) = 00, 
then reversing the above argument, we see that there are polynomials {R n } satisfying (2.2), with 
f(t) = (t — z)~ l . If Q(z) = 00 for all z, we can approximate linear combinations 

m 

m = E 



J= i j 

by polynomials in the norm || • W||£, m). For small e, we can then approximate 



2e[(t-z)-e (t-z)+e\ (t - zf 
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and hence also l/(t — z) 2 . Iterating this, we can approximate l/(t — z) m for any non-real z, and 
m > 1. We can then approximate polynomials in l/(t — z). The latter can in turn uniformly 
approximate on the real line, any continuous function f(t) that has limit at oo. (Use Weierstrass' 
Theorem and make a transformation x = \/{t — c).) Since W < 1, we then obtain a positive 
solution to Bernstein's problem, for continuous functions that have limit at oo. This class of 
continuous functions is big enough to approximate arbitrary ones for the Bernstein problem. For 
full details, see [67l p. 148 ff. ]. 
Next, we show: 

Lemma 2.2 In order that Bernstein's approximation problem has a positive solution, it is neces- 
sary that 

H* 1 /"™-*.*. (2.3) 



, 1 + * 2 
Proof. The basic tool is the inequality 



valid for all polynomials P. To prove this, suppose first that P has no zeros in the closed upper 
half-plane. We can then choose an analytic branch of log P there, and the residue theorem gives 

1 f°° log P(t) , 
loe P(i) = - / ° V dt. 



Taking real parts, we obtain (2.4), with equality instead of inequality. When P has no zeros in 
the upper half-plane, but possibly has zeros on the real axis, a continuity argument shows that 
(2.4) persists, but with equality. Finally when P(z) contains factors [z — a) with a in the upper 
half-plane, we use 



, . . , ._ . l f°° log a-t , l f°° log o-t , 

log o - % < log \a - i \ = - / dt = - / dt 

7T ./_ no l + t Z 7T J_^ l+t 2 



and divide such factors out from P. So we have (2.4). Now if 

P(t)W(t 

~7T+t? 



sup^EP<l, (2.5) 



we obtain 



and hence 



i f°° log \p(t)\_ dt < i r° io g (i/w(t)) dt + J_ r log i 1 + t2 ) dt 



vr 7_ 00 1 + 1 2 - vr 7_ 00 1 + 1 2 2vr 7_ 00 1 + 1 2 

, .„,... 1 f°° log(l/W(t)) , 1 [°° log (1 + 1 2 ) , 

Taking sup's over all P satisfying (2.5) gives 

As we assumed Bernstein's approximation problem has a positive solution, Lemma 2.1 shows that 
= oo, and hence (2.3) follows. □ 
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With a little more work, this proof also gives 

io g n(t) 



oo 1 + * 2 



dt = oo, 



see [571 P- 153]. 

3 Weighted Jackson and Bernstein Theorems 

In the 1950's the search began for quantitative estimates, rather than theorems establishing the 
possibility of approximation. Bernstein and Jackson had provided quantitative forms of Weierstrass' 
Theorem in 1911 and 1912, two wonderful years for approximation theory. Recall first the classical 
(unweighted) case. Jackson and Bernstein independently proved that 

En [/]«, := a mf ||/ - PlU^-1,1] < ^ll/'llw-i,!]' (3- 1 ) 

acg(P)<n TL 

with C independent of / and n, and the inf being over (algebraic) polynomials of degree at most n. 
The rate is best possible amongst absolutely continuous functions / on [—1,1] whose derivative is 
bounded. More generally, if / has a bounded kth derivative, then the rate is 0(n~ k ). In addition, 
Jackson obtained general results involving moduli of continuity: for example, if / is continuous, 
and its modulus of continuity is 

to (/; 5) = sup {\f(x) - f(y)\ :x,ye[-l, 1] and \x - y\ < 5} , 

then 

En [/loo < Cu (/; 1 

where C is independent of / and n. 

Bernstein also obtained remarkable converse theorems, which show that the rate (or degree) 
of approximation is determined by the smoothness of /. These are most elegantly stated for 
trigonometric polynomial approximation. Let 

£ n [g] := inf \\g - i2|| Zoo[0)Jhr] 

denote the distance from a periodic function g to the set of all trigonometric polynomials R of 
degree < n. Let < a < 1. Bernstein showed that 

£ n [g}=0(n~ a ), twoo ^ cj(g;t) =0(t a ), t - 0+, 

where uj (g; ■) is the modulus of continuity of g on [0, 2tt], defined much as above. That is, the error 
of approximation of a 27r-periodic function g on [0, 2tt] by trigonometric polynomials of degree at 
most n decays with rate n~ a iff g satisfies a Lipschitz condition of order a. Moreover, if k > 1, 

£ n [g] = 0(n~ k - a ), twoo <=> u(g^ ;t) = 0(t a ), t^0 + . 

Here in the converse implication, the existence and continuity of the kth. derivative is assured. 
Bernstein never resolved the exact smoothness required for a rate of decay of n , or more generally 
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0(n- k ). The case k = 1 was solved much later in 1945 by A. Zygmund, the father of the Chicago 
school of harmonic analysis, and author of the classic "Trigonometric Series" [193J. Zygmund used 
a second order modulus of continuity. 

For approximation by algebraic polynomials, converse theorems are more complicated, as better 
approximation is possible near the endpoints of the interval of approximation. Only in the 1980's 
were complete characterizations obtained, with the aid of the Ditzian-Totik modulus of continuity 
|36j . An earlier alternative approach is that of Brudnyi-Dzadyk-Timan [3T] . We shall discuss only 
the Ditzian-Totik approach, since that has been adopted in weighted polynomial approximation. 
Define the symmetric differences 

A fc /(x) = /(x +£)-/(*-£); 
A 2 h f(x) = A h (A h f(x)); 

A k h f{x) = A h (A k ~ l f(x] 

so that 



A£/(s) = £ (*) (-!)«/(* + k\ - ih). (3.2) 



i=0 

If any of the arguments of / lies outside the interval of approximation ([—1,1] in this setting), we 
adopt the convention that the difference is 0. The rth order Ditzian-Totik modulus of continuity 
in L p [—1, 1] is 

u r v (f;h) p = sup [| A^ /j— 2 -/(z)||x p [-i ) i]. 

0<h<t 

Note the factor 

<p(x) = yl — x 2 

multiplying the increment h. This forces a smaller increment near the endpoints ±1 of [—1,1], 
reflecting the possibility of better approximation rates there. 

For 1 < p < oo, Ditzian and Totik |36|, Thm. 7.2.1, p. 79] proved the estimate 

dcg(P)<n n 

with C independent of / and n. This implies the Jackson (or Jackson-Favard) estimate [31\ p. 260] 

En[f] p <Cn-Vf^\\ Lv[ _ lM , 

n > r, provided /( r_1 ) is absolutely continuous, and the norm on the right-hand side is finite. 
Moreover, they showed that if < a < r, then [61\ p. 265] 

En[f] p = 0(n- a ), rwoo, (3.3) 

iff 

«> r v (f;h) p = o{h a ), h^o + . 

For example, if (3.3) holds with a = 3^, this implies that / has 3 continuous derivatives inside 
(— 1, 1) and /'" satisfies a Lipschitz condition of order 1/2 in each compact subinterval of (—1, 1). 
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This equivalence is easily deduced from the Jackson inequality above and the general converse 
inequality [361 Theorem 7.2.4, p. 83] 



(/; t) p < Mf Y, ( n + l Y~ lE n W P > * e (0, 1) 

0<n<i 



The constant M depends on r, but is independent of / and t. Of course, this subject has a long and 
rich history, and all we are attempting here is to set the background for developments in weighted 
approximation. Please forgive the many themes omitted! 

For weights on the whole real line, the first attempts at general Jackson theorems seem due to 
Dzrbasjan [37]. He obtained the correct weighted rates, but only when restricting the approximated 
function to a finite interval. In the 1960's and 1970's, Freud and Nevai made major strides in this 
topic |150j . That 1986 survey of Paul Nevai is still relevant, and a very readable introduction to 
the subject. 

Let us review some of the fundamental features discovered by Freud, in the case of the weight 
W a (x) = exp (— \x\ a ) , a > 1. A little calculus shows that the weighted monomial x n W a {x) attains 
its maximum modulus on the real line at Freud's number 



(n/a) 



l/a 



Thereafter it decays quickly to zero. With this in mind, Freud and Nevai proved that there are 
constants C\ and C2 such that for all polynomials P n of degree at most n, 

\\P n W a \\ Lp{R) < C 2 \\PnW a \\ L ^_ Cinl/a ^ Cinl/a y (3.4) 

The constants C\ and C2 can be taken independent of n, P n and even the L p parameter p 6 
[l,oo]. Outside the interval [— Cin 1 /", Cin 1//Q ] , P n W a decays quickly to zero. This meant that 
one cannot hope to approximate fW by P n W outside [— Cira 1 /", Cin 1 /"] . So either a "tail term" 
ll/Wallj-, r| x |>c in i/ai must appear in the error estimate, or be handled some other way. Inequalities 
of the form (3.4) are called restricted range inequalities, or infinite-finite range inequalities. 

The sharp form of these was found later by Mhaskar and Saff, using potential theory [136] . 
[138] . |139j . [140] . [167] . Let W = exp(— Q), where Q is even, and xQ'(x) is positive and increasing 
in (0,oo), with limits and 00 at and 00, respectively. For xi ^ 1, let a n — a n (Q) denote the 
positive root of the equation 

We call a n the nth Mhaskar Rakhmanov Saff number. For example, if a > 0, and W a (x) = 
exp(— \x\ a ), 



\2 



l/a 



Mhaskar and Saff established the Mhaskar— Saff identity: for polynomials P n of degree at most 
n, 

\\PnW\\ Lao( K) = \\P n W\\ Loo [_ an}an ]. (3.6) 
Moreover, if P n is not the zero polynomial, 

\\ P nW\\ Loo ( R \[_ an)an ^ < \\PnW\\ Lao \_ anAn } 
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and a n is asymptotically the "smallest" such number. 

There are L p analogues, valid for all p > 0. For example, if e > 0, there exists C > such that 
for re > 1 and polynomials P n of degree < re, 

||f , nW / ||i p (R\[-a„(l+e),o„(l+e)]) < e ~ Cn ll-Pri^lL^-a^] • (3-7) 

We shall discuss these more in Section 4.6. 

The next task is to determine what happens on [—Ca n ,Ca n ]. Now if we had to approximate 
in the unweighted setting on this interval, a scale change in the Jackson-Bernstein estimate (3.1) 
gives 

, loL \\f ~ P \\Loc,[-Ca n ,Ca n }<CCl-^\\f'\\ Lao y_Ca n ,Ca n \■ 
aeg{P)<n re 



Remarkably, the same is true when we insert the weight W a in both norms: 

, loL \\(f ~ P ) W »\\Lool~Ca n ,Ca n ] < C 3 -^\\f'W a \\ Loo[ _ Can:Can] . (3.8) 
deg(F)<n Tl 

Very roughly, this works for the following reason: it seems that if C is small enough, we can 
approximate 1/W a on [— Ca n ,Ca n ] by a polynomial R n /2 of degree < re/2, and then use the 
remaining part n/2 degree polynomial in P to approximate fW a itself on [—Ca n ,Ca n ]. In real 
terms, this approach works only for a small class of weights. Nevertheless, it at least indicated the 
form that general results should take. To obtain an estimate over the whole real line, Freud then 
proved a "tail inequality" , such as 



WfW a \\ Lp[lxl > Can] < C^\\f'W a \\ Lp(R) , (3.9) 



with C4 independent of / and n. Combining (3.8), (3.9), and that suitable weighted polynomials 
are tiny outside [— Cia n ,C\a n ] yielded the following: 

Theorem 3.1 Let 1 < p < 00, a > 1 and f : R — > R be absolutely continuous, with \\f'W a \\L (R) < 
00 . Then 

E n [f;W a }:= inf ||(/ - P)W a \\ L p(R) < C 5 — \\f'W a \\ Lp{R) , (3.10) 

dog(P)<n re 

with C5 independent of f and re. 

While this might illustrate some of the ideas, we emphasize that the technical details underlying 
proper proofs of this Jackson (or Jackson-Favard) inequality are formidable. Some of these ideas 
will be illustrated in the next section. Freud and Nevai developed an original theory of orthogonal 
polynomials for the weights W% partly for use in this approximation theory. 

We note that Freud proved (3.10) for W a for a > 2 [IT] . [42]. The technical estimates required 
to extend this to the case 1 < a < 2 were provided by Eli Levin and the author [80]. What about 
a < 1? Well, recall that the polynomials are only dense if a > 1, so there is no point in considering 
a < 1. But a = 1 is still worth consideration, and we shall discuss that below in Section 5. 

One consequence of (3.10) is an estimate of the rate of weighted polynomial approximation of 
/ in terms of that of /'. Indeed, if P n is any polynomial of degree < re — 1, then 



E n [/; W a ] p = E n [f- P n ; W a ] p < Cfc^H (/ - Pn)' W a \\ Lp(R) , 
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and since P' n may be any polynomial of degree < n — 1, we obtain the Favard or Jackson-Favard 
inequality 

E n [/; W a ] p < C 5 ^E n -! [/'; . (3.11) 

This can be iterated: 

Corollary 3.2 Let 1 < p < oo, r > 1, a > 1 and / : R — ► M have r — 1 continuous derivatives. 
Assume, moreover, that is absolutely continuous, with ||/^W a ||x, (r) < oo. Then for some C 
independent of f and n, 

E n [/; W a ] p < C 5 (^Y \\f^W a \\ Lpm . (3.12) 



Freud also obtained estimates involving moduli of continuity. Here one cannot avoid the tail 
term, and has to build it directly into the modulus. Partly for this reason, there are many forms 
of the modulus, and more than one way to decide which interval is the principal interval, and over 
what interval we take the tail. However it is done, it is awkward. We shall follow essentially the 
modulus used by Ditzian and Totik [36] , Ditzian and the author [33] , and Mhaskar [136] . All these 
owe a great deal to earlier work of Freud. 

The first order modulus for the weight W a has the form 

ui, P (f,W a ,t)= sup \\W a (A h f)\\ i i +inf \\(f- c )W a \\ / , x 

0<h<t L p [-hT=^ ,/lT=a] CSK L p R\[-tT=3 jt T=3 



Why the inf over the constant c in the tail term? It ensures that if / is constant, then the 

modulus vanishes identically, as one expects from a first order modulus. Why the strange interval 

i i 

[— h 1 -^ , /j !-<*]? It ensures that when we substitute 



n l/a 



h= — =n- 1+1 ' a , 



n 

then 



[-h?=z,hi=z] = [-n 1 /",^/"] = [-C x a n ,C x a n ], 
for an appropriate constant C\ (independent of n). More generally if r > 1, the rth order modulus 



is 



u r ,p{f,W a ,t) = sup \\W a (A r h f) 

0<h<t 



+ inf \\(f-P)W a \\ i _j_ i v (3.13) 

deg(P)<r-l LpRy-t^,!!-"] 1 



Again the inf in the tail term ensures that if / is a polynomial of degree < r — 1, then the modulus 
of continuity vanishes identically, as is expected from an rth order modulus. The Jackson theorem 
takes the form 

E n [/; WJ < Cuj r , p {f, W a , ^). (3.14) 
y n 

This is valid for 1 < p < oo, and the constant C is independent of / and n (but depends on p and 

W a ). 
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One can consider more general weights than W a . Almost invariably the weight considered has 
the form W = exp(-Q), and the rate of growth of Q has a major impact on the form of the 
modulus. Let us suppose, for example, that Q is of polynomial growth at oo, the so-called Freud 
case. The most general class of such weights for which a Jackson theorem is known is the following. 
It includes W a ,a > 1, but excludes W\. 

Definition 3.3 (Freud Weights) Let W = exp(— Q), where Q : R — > R is even, Q' exists and is 
positive in (0, oo). Moreover, assume that xQ'{x) is strictly increasing, with right limit at 0, and 
for some A, A, B > 1, C > 0, 

^w-* x> - c - (3 - i5> 

Then we write W € T . 

For such W, we take a n to be the positive root of the equation (3.5) (the existence and uniqueness 

of a n is guaranteed by the strict monotonicity of xQ'(x)). To replace the function t 1 ^, we can use 
the decreasing function of t, 

a(t) := inf \a n : — < t\ , t > 0. (3.16) 
In) 

The modulus of continuity becomes 

u r , P (f,W,t) = sup \\W (A r h f) \\ L p [- a (h),a(h)] + 
0</i<t 

The reader new to this subject will be encouraged to hear that this strange looking creature 
has all the main properties of more familiar moduli of continuity [33], [35], [36], [48] . [75], [136] : 

Theorem 3.4 (Properties of u) r , P ) Let W € T, r > 1, < p < oo. Let / : R — >• R, and if p < oo, 
assume that fW £ L p (R). If p = oo, assume in addition that f is continuous and that fW has 
limit at ±oo. 

(a) u) r ^ (/, W, t) is an increasing function of t > 0. 
(b) 

lim LJ r ,p(f, W,t) = 0. 

(c) Assume that p > 1, or assume that W admits a Markov-Bernstein inequality 

n 

\\P'W\\ Lpm < C- \\PW\\ L (R) , (3.18) 

valid for n > 1, and polynomials P of degree < n, where C ^ C{n,P). Then there exists 
d / Ci(t,/) such that 



w r , p (/, W, 2t) < Ciw rj ,(/, W, t), t > 0. 
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(d) Assume that p > 1. Then for t > 0, 



(f,W,t)<Cf\\f^W\\r 



provided f( r ^ is absolutely continuous and the norm on the right-hand side is finite. 

(e) Let r > 1. There exists C ^ C(f,t), such that 

w r , P (/, W, i) < CW_i, p (/, W, t), i > 0. (3.19) 

(f) Assume the Markov-Bernstein inequality (3.18). Then if q = min{l,p}, there is the Marchaud 

inequality 

( r ^ uj q ( f w iA ~\ l l q 

urMw^KCtf^ r+1 ;; g ; 1 ' } du + \\fw\\i p{ ^ , 

where C,(/,i), j = 1,2. 

(g) Assume that f3 > 1 and < p < q < oo. Then there is the Ulyanov type inequality 



u r , q (f,Wp,t) < C-{ jf [«« p U r ,p(f, W/3,u) 



, . t > 0, 
« J 



where C^C(f, t). 



(h) Assume that p > 1 and for some < a < r, 

Wr,p(/,W,t) = 0(t°), t^0 + . (3.20) 
Let = \_a\ , the integer part of a. Then exists a.e. in the real line, and 

u} r „ k , p (f {k) ,W,t) = 0(t a - k ), t^0 + . (3.21) 
If p = oo, is continuous on the real line. 

Proof, (a) This is immediate as o~(t) is a decreasing function of t. 

(b) This follows as for classical moduli of continuity. One first establishes it for suitably re- 
stricted continuous functions, and then approximates an arbitrary function by a continuous one. 

(c) This is part of Theorem 1.4 in |33|. p. 104]. 

(d) This is part of Corollary 1.8 in [331 P- 105]. 

(e) This follows easily from the definition (3.13) and the recursive definition of the symmetric 
differences. 

(f) This is Corollary 1.7 in [331 P- 105]. 

(g) This is part of Theorem 9.1 in [351 P- 133]. 

(h) See [361 PP- 62-64] for the analogous proofs on a finite interval. □ 
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Note all the strictures for p < 1. Fundamentally these arise because we cannot bound norms 
of functions in terms of their derivatives when p < 1. At least the Jackson theorem is the obvious 
analogue of the result for finite intervals [331 Theorem 1.2, p. 102]: 

Theorem 3.5 Let W E J-, r > 1, and < p < oo. Let fW £ L p (M). If p = oo, we also require f 
to be continuous and fW to have limit at ±oo. Then for n > r — 1, 

E n [/; W] p < durM W > C2— ), (3.22) 

where C\ and C2 are independent of f and n. 

In the case where p > 1, or W admits the Markov-Bernstein inequality (3.18), one can omit 
the constant C2 inside the modulus. This follows directly from Theorem 3.4(c). We shall say much 
more about Markov-Bernstein inequalities in Section 7. 

Corollary 3.6 Let 1 < p < 00, r > 1, W £ T and f : R — > R have r — 1 continuous derivatives. 
Assume, moreover, that is absolutely continuous, with \\f^W\\L (m < 00. Then for some C 
independent of f and n, 

E n [f;W] p < C 5 (^) r \\f^W\\ Lp(R) . (3.23) 

The converse inequality, which can be interpreted as a Bernstein type converse theorem, has 
the form [33j p. 105]: 

Theorem 3.7 Let < p < 00, r > 1, W G T . Assume that W admits the Markov-Bernstein 
inequality (3.18). Let q = min{l,p}. For t < a\, dehne the positive integer n = n(t) by 



n := n 



:=inf{fc: ^ < i} . (3.24) 
Then for some C ^ C (t, /) , 

[log 2 n] ■ rq 

u; r>p (f,W,ty<c(^) J2 — E 2J [f;W\l, (3.25) 

where we define E 2 -i ■= Eq and \ \0g2 n\ denotes the largest integer < log 2 n. 

From this one readily deduces: 
Corollary 3.8 Assume the hypotheses of the previous theorem, and let < a < r. Then 
uj r , p (f,W,t) = 0(t a ), t^0+ En [f;W} p = O^Y) , n^oo. 

A related smoothness theorem is given by Damelin [1 



One of the important tools in establishing this is iT-functionals and the concept of realization. 
This is a topic on its own. In the setting of weighted polynomial approximation, it has been explored 
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by Freud and Mhaskar, and later Ditzian and Totik, Damelin and the author. See [TT], [27], [33J, 
[Sj, [35], |136j . |137j . In our context, an appropriate K-functional is 

K rJt (f,W,t r ) := inf {||(/ - g)W\\ Lp{R) +t r \\g^W\\ Lpm } , t > 0, 

where the inf is taken over all g whose (r — l)st derivative is locally absolutely continuous. It works 
only for p > 1, again because of the problems of estimating functions in L p , p < 1, in terms of their 
derivatives. The X-functional is equivalent to the modulus of continuity in the following sense |33|. 
Thm. 1.4, Cor. 1.9, pp. 104-105]: 

Theorem 3.9 Let W G T and p > 1. Then for some C\, C% > independent of f, t, 

CW r>p (/, W, t) < K rjp (/, W, f) < CWr,p(/, W, t). 

The appearance of fairly general functions g in the inf defining K TjP helps to explain its use- 
fulness. In fact, many of the properties of the modulus described above, go via the K-functional. 
When p < 1, the K-functional is identically zero, so instead we use the realization functional, 
introduced by Hristov and Ivanov [50] and analyzed by those authors and Ditzian |32j : 

K r , p (/,^,^) : =inf{||(/-P)W|| Lp(K) +^||^ (r) W||L p (M)}, *>0. 

Here the inf is taken over all polynomials P of degree < n(t) and n(t) is defined by (3.24). For 
p > 1, K r>p and K r>p are equivalent [32]. For all p, one can prove [351 Thm. 1.4, p. 104]: 

Theorem 3.10 Assume W £ J- and W admits the Markov-Bernstein inequality (3.18). Then for 
some Ci, C2 > independent of f, t, 

CVr, P (/, W, t) < K r , p (/, W, f ) < C 2 cu riP (f, W, t). 

Observe that if we choose t = a n /n, the above result actually gives more than the Jackson 
estimate Theorem 3.5, at least when ak/k decreases strictly with k. We obtain 

a. { |k/ - p)w,|ii '<" + ||p,r,M ' || ^"«} s c ^ } - w - 

so there is an automatic bound on the rth derivatives of best approximating polynomials. 

The Freud weights above have the form W = exp(— Q), where Q is of polynomial growth at 00, 
with Q(x) growing at least as fast as \x\ a for some a > 1. For the special weight exp(— |x|), the 
polynomials are still dense, but we have not established anything about the degree of approximation. 
We shall devote a separate section to this. The case where Q is of faster than polynomial growth 
is often called the Erdos case, and also has received some attention. The main difference is that 
the modulus of continuity becomes more complicated. Here is a suitable class of Erdos weights: 

Definition 3.11 (Erdos Weights) Let W = exp(— Q), where Q : E — > E is even, Q' exists and 
is positive in (0, 00). Assume that xQ'ix) is strictly increasing, with right limit at 0, and the 
function 

T W := *M (3.26) 
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yQ'iv) < r (Q(y)\° 2 

xQ'(x)- ^Qix)) 



is quasi-increasing in the sense that for some C > 0, 

< x < y => T(x) < CT(y), (3.27) 

while 

lim T(x) = oo. 
Assume, moreover, that for some C\, C2 and C3 > 0, 

y>x>C 3 . 

XleJ [X) \^{X) / 

Then we write W G 6. 

Examples of weights in this class include 

W{x) = exp(- ex$ k (\x\ a ) + exp fc (0)), 

where a > and for k > 1, 

exp fc = exp(exp(- • • exp())) 
denotes the A:th iterated exponential. We set exp (x) = x. For this weight, 

k-l 

T(x) ~ ax a Yl 

for large x [851 p. 9], while the Mhaskar-Rakhmanov-Saff number has the asymptotic [85, p. 29] 

log fc -i ^log n - \ J2 ^ n + O(l) \ J = (log* nf a (1 + o(l)) , 

n — ► 00; log k denotes the A;th iterated logarithm. 

For general W G £, the modulus of continuity involves the function 



where a(t) is as in (3.16). One may think of this as an analogue of the function y/l — x 2 which 
appears in the Ditzian-Totik modulus of continuity, and it appears for the same reason: in ap- 
proximating by polynomials with Erdos weights, the rate of approximation improves towards the 
endpoints of the Mhaskar Rakhmanov Saff interval. The modulus is 

UJ r ,p(f,W,t) = SUp ||W(a?) ( A£ 4 t f x )f(x)) \\ L p [- a (2t),a(2t)] 
0<h<t v 7 

+ dcg (M-i 11 (/ " p ^ w ^\i-«m,*my ( 3 - 28 ) 

Once one has this modulus, the Jackson estimate goes through [271 Thm. 1.2, p. 337]: 
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Theorem 3.12 Let W G E , r > 1, and < p < oo. Let fW G L p (R). If p = oo, we also require 
f to be continuous and fW to have limit at ±oo. Then for n > 1, 

£n [/; W% < Ctu r , p (f, W, Cfe— ), (3.29) 

where Ci and C2 are independent of f and n. 

There are also converse estimates |11| . [16j . |27j . The details are more difficult than for Freud 
weights, because of the more complicated modulus. Analogous results for exponential weights on 
(—1,1) are given in |15| . [92], [93]. For exponential weights multiplied by a generalized Jacobi 
weight or other factor having singularities, see [49j . |120| . j!21| . [122] . [178] . For Laguerre and 
other exponential weights on (0, 00), see [28J, [62], |115j . Geometric rates of approximation in the 
weighted setting have been explored in [70], p27], [El], [135], [136]. 



4 Methods for Proving Weighted Jackson Theorems 

In this section, we shall outline various methods to prove weighted Jackson Theorems, but will 
not provide complete expositions. In the unweighted case, on [—1,1], many of the elegant meth- 
ods involve convolution operators. However, unfortunately these depend heavily on translation 
invariance, so fail for the weighted case. We begin with two of the oldest, used by Freud and Nevai. 



4.1 Freud and Nevai's One-sided L\ Method 

The L\ method is primarily a tool to obtain estimates on the degree of approximation for special 
functions such as characteristic functions. Once one has it, one can use duality and other tricks 
to go to Lqo, and then interpolation for 1 < p < 00, and this will be done in the next section. 
The method is based on the theory of orthogonal polynomials, and Gauss quadratures. Under the 
tutelage of Geza Freud and Paul Nevai, the two subjects of weighted approximation and orthogonal 
polynomials for weights on the real line, developed in tandem throughout the 1970's and 1980's. 
Mhaskar's monograph [136] provides an excellent treatment of the material in this and the next 
section. 

Corresponding to the weight W, we define orthonormal polynomials 



Pn(x) = p n (W 2 ,x) = 7„x n + 



where j n > 0, satisfying 



PnPmW 2 = 5, 



2 _ 

mn. 



Note that the weight is W 2 , not W. This convention simplifies some formulations later on. Let us 
denote the zeros of p n by 

The rath Christoffel function for W 2 is 

f°° (PW) 2 

X n (W 2 ,x)= inf J -°° K . (4.1) 

dcg(P)<n-l P 2 {X) 
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It also satisfies 

n-1 
3=0 

Try this as an exercise: expand an arbitrary polynomial of degree < n — 1 in {pj}™Zoi an d then 
use Cauchy-Schwarz, and orthonormality. 

Many readers unfamiliar with the detailed theory of orthogonal polynomials, will nevertheless 
have seen the Christoffel functions in the Gauss quadrature formula: 



/oo w 
PW 2 = Y,^n(W 2 ,X Jn )P( Xjn ), 
"OO -i 



i=i 

valid for all polynomials P of degree < 2n — 1. For fixed £, one can also develop similar quadrature 
formulae, based on the zeros of 

Pn(x)p n - 1 {C) - Pn-l{x)p n {£). 

These zeros are real and simple and include £. There are important inequalities, the Posse-Markov- 
Stieltjes inequalities, that are used to analyze these quadratures. In the simple Gauss case, they 
assert that if / is a function with its first 2n — 1 derivatives positive in (— oo,Xk n ), then 

K(W 2 ,x jn )f(x jn )< / fW 2 <^2\n(W 2 ,x jn )f(x jn ). 

j=k+l J -°° j=k 

See [HU p. 33], \136\ p. 13]. This may be used to prove [1361 p. 17]: 

Lemma 4.1 Let n > 1, and £ G (xk+i,ni x kn\- Then there exist upper and lower polynomials i?g 
and such that 

and 

/oo 
(i?£ - r c ) W 2 < X n (W 2 ,x kn ) + \ n (W 2 , x k . 
-oo 



Thus once we have upper estimates on the Christoffel functions, we have bounds on the error of 
one-sided polynomial approximation. One could write a survey on methods to estimate Christoffel 
functions, there are so many. Paul Nevai paid homage to them and their applications in his still 
relevant survey article |150j . as well as in his earlier memoir [148]. We shall present a very simple 
method of Freud in a very special case: 

Lemma 4.2 Assume that W = exp(— Q) G T , and in addition that Q is convex. Then there exists 
C\,C% > such that for n > 1 and |£| < Cia n , 

\ n {W 2 ^)<C 2 ^W 2 {i). (4.2) 
n 
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Proof. Let 



3=0 " 

denote the mth partial sum of the exponential function. Fix £, n and let 

R(x)=W-\Qe W2] {(x-OQ'(0)- 
Here, [n/2\ denotes the integer part of n/2. Now 

|e[n/2j(«)| < C 2 exp(u), 
for \u\ < |. Moreover, given e > 0, we have if |x| < 2a n , and |£| < ea n , 

|(x-0Q'(0|<3a n Q'(ea n )<- ) 



(4.3) 



if only £ is small enough. To see this, we use the definition of a n and the monotonicity of t <— > tQ'(t) : 
_ 2 j" 1 antQ'{a n t) dt > 2 y 1 a n , /a^x 



vr i v 7 !^? vr / 1/2 ^T^i 1 2 



Thus 



«nQ' (^) < Cm. 
Using the lower bound in (3.15) of Definition 3.3, we obtain for m > 1, 

anQ i (A— ^) < ,4- m a„Q' (^) < C^n. 
There we had A, A > 1, so choosing e = A~ m /2 with large enough m, gives (4.3). Next, 

|i?(x)| < C 2 lU- 1 (0exp((x - 0Q'(0) = C 2 exp(Q(0 + (x - Otf(0) < C 2 exp(Q(x)), 
by convexity of Q. Thus 



Moreover, 



\R(x)\W(x) < C 2 for all \x\ < 2a n . 



(RW)(0 = 1. 



Now, we use the extremal property (4.1) of Christoffel functions. We set P = Rp, there, where p 
is a polynomial of degree < [n/2\ . We see that 



K(w\o/w 2 (0 = inf 

deg(P)<n 



< inf 

dcg(p)<Ln/2j [j 

— oo 



(PWO 2 

/oo 
(piW) 
-oo 



/(pwf(0 

/(pRW) 2 (0 



< C inf 

dcg(p)<Ln/2j 



2<l n 



{pRWf 



2a n 



/(pRW)\0, 
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by the restricted range inequality (3.7). Now we use the upper bound on RW and (RW)(£) = 1 to 
continue this as 

< CC 2 inf ( I 2 *™ p 2 ] /p 2 (0 = CCpa n inf ( C S 2 ] / S 2 ( -L) . 

" 2 deg(p)<Ln/2jVy-2a„ 7 deg(5)< [n/2\ \J_j_ J' \2d n ) 

The latter is the Christoffel function of order [n/2j for the Legendre weight on [—1,1], evaluated 
at £/(2a n ). Using classical estimates for these [38] P- 103], we continue this as 

x n (w 2 ,o/w 2 (0<c^. □ 

n 

Now we present a very special case of the L\ approximation: 

Theorem 4.3 Assume the hypotheses of Lemma 4.2 on W. Assume that f is continuous, and 
f'W 2 £ L\ (R) , while f is of polynomial growth at oo . Then there exist upper and lower polynomials 
S n and s n such that 

s n <f<S n in R (4.4) 



and 



r ( Sn - Sn) w 2 <c^(r i/'i w 2 + wfw 2 \\ Loc{lxl > Can) ) . 

J — oo \«/ — oo / 



Main idea of the proof. We assume that /' ^ 0, and that f — outside (x nn ,xi n ) to simplify 
the proof. Recall that x nn and x\ n are the largest and smallest zeros of p n (W 2 ,x). We write the 
fundamental theorem of calculus 

/(x) = /(o)+ f x nm 

Jo 

in the form 

/•oo rQ 

f( X ) = /(0) + / (1 - X(-oo, f ](^))/ / (0^ - / X(-ood X )f'^. 
JO J-oo 

To check this, consider separately x > and x < 0. We use the upper and lower polynomials R^ 
and r£ of Lemma 4.1 and define 

/•oo p0 

S n (x) = /(0) + / (1 - r c 0c))/'(£K " / rz(x)f\£)d£ 

JO J-oo 

and 

/■oo r0 

s n (x) = /(0) + / (1 - Rz(x))f(0<% ~ / Rt(x)f'(£)d£. 

Jo J -co 

As /' > and < X(-oo,£] < we obtain (4.4). Moreover, 

/•oo 

(S n -s n )(x)= / (R^-r^x) 



Weighted Approximation 



23 



so 



/oo poc roo 

(S n -s n )W 2 < / / (R^-r^(x)W 2 (x)dx 
-oo J —oo \_J — oo 

Now by Lemmas 4.1 and 4.2, and some other minor considerations, 

f'°° n 
/ - r € )(x)W 2 (x),ix < C— T¥ 2 (,£), 

J — oo ^ 



at least for |£| < Ca n . Thus, 

/oo /-Can 
-oo ^ J— Ca„ 



'w 2 



+ ll/"^ 2 ! 



Loo(M>Ca n ) 



IS I > Car. 



W (0 / -r $ )(x)W^(s)da 



di. 



The first term on the right-hand side has the correct form. The second tail term does not. One 
can still use estimates for Christoffel functions, and some other tricks. See [40], [46] or [136^ p. 83 
ff.] for more details. □ 

This is what Freud proved in 1974 [40 \ p. 297], using these ideas: 

Theorem 4.4 Let Q be an even and convex function on the real line. Assume that Q is increasing 
and Q' exists in (0, oo), and that for some C\,C<i > 0, 

Q'(2x) 

1+Cl< -Q^y <1 + ° 2 - 

Let r > and be of bounded variation over every finite interval, and of polynomial growth at 
oo, satisfying for some A, B > and integer m, 

\f{x)\ <A + Bx 2m . 

Then there exist upper and lower polynomials S n and s n of degree < n such that (4.4) holds and 

(S n - s n )W 2 < C (^) ( W 2 \df^\+A + B\. 



The constant C is independent of f, n, A, B. 

While this is a one-sided L\ result, it also implies 



E n [f;W\ <c(^) r+l (T° W 2 \df^\+A + B^ . 

This can be extended to other L p spaces, using duality. See [120] for an extension to weights with 
inner singularities. 
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4.2 Freud's Method involving de la Vallee Poussin Means 

This is based on orthogonal expansions, and for the finite interval, was used before Freud. Freud 
was the first to make it work on infinite intervals [H], [22]. For a measurable function for which 
f^^ifW) 2 is finite, we can form the orthonormal expansion 

oo 
j=0 

where 

poo 

f Pj W 2 , j > 0. 

> 

Define the partial sums of the orthonormal expansion, 

n-1 



Sn[f] = Y, c iPr 

3=0 

The classic minimum property asserts that 

\\{f - S n [f])W\\ Lm = E n ^[f-W] 2 = inf \\(f-P)W\\ Lm , 

deg(P)<n— 1 

and the classic Bessel's inequality asserts that 

(l \ 
En 2 ) <\\fw\\ Lm - 

In particular, {S n } is a uniformly bounded sequence of operators in a weighted L 2 space. 
Sometimes it is more convenient to use the de la Vallee Poussin operators 



In 



j=n+l 

We see that still 

ll^[/]W|| i2(R) <||/W|| L2(R) 
so the {V n } are uniformly bounded in this weighted L 2 setting. They also reproduce polynomials: 

V n [P] = P 

if P is a polynomial of degree < n — 1. However, they are not projection operators, as V n [P] is a 
polynomial of degree 2n — 1 in general. The real advantage is that in spaces other than L 2 , {V n } 
is often uniformly bounded when {S n } is not. 

Another crucial ingredient are estimates for the Christoffel functions defined by (4.1). Let us 
outline some of the main ideas of Freud's method to prove the Jackson-Favard inequality 



E n [f;W] p <C^E n ^ [f';W] p . (4.5) 
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(a) We show that {V n } is a uniformly bounded sequence of linear operators, first in L^, and then 
in L\ (using duality) and then in L p , 1 < p < oo (using interpolation). 

(b) We approximate a very special function, something like a characteristic function, in L\ } using 
the method of the previous section. 

(c) We prove the estimate in L^, using {V n } and the special function. 

(d) We extend to L p , 1 < p < oo. 

In our outline, we shall do this only for p = oo. 

The first step is to show, for some C ^ C(n, /), that 

E 2n [/; W}^ < ||(/- V n [/]) W\\ Loo(M) < CE n [/; W}^ . (4.6) 
Lemma 4.5 The inequality (4.6) holds, provided we assume the estimate (4.13) below. 

Proof. The left inequality here is immediate, as V n [f] is a polynomial of degree < 2n — 1. It is 
the right-hand inequality that requires work. We follow Freud [12]. The idea goes back at least to 
Torsten Carleman. The partial sum s m [f] admits the representation 

m— 1 /"oo 

s m [/] (x) = C 3PA X ) = / f(t)K m (x,t)W 2 (t)dt, (4.7) 

where 

m— 1 

K m (x,t) := J2pj(x)pj(t). 
The Christoffel-Darboux formula 1 38 1 asserts that 



K m (x,t) = . (4.8) 

7m X - t 

We now fix x and n, and define 

9 = f{ x )X( x -2rL x+2^l) 

\ n ' n J 

and 

h = f — g = fXm( x -Bn, x+ <izL)- 

Here denotes the characteristic function of a set S. Thus g = / in an interval of radius ^ about 
x, and is elsewhere, while h is the "rest" of /. We have 

Sm [/] (x) = s m [g] (x) + s m [h] (x). (4.9) 
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Now 



\s m [g]\ (x) 



f(t)K m (x,t)W 2 (t)dt 



2a ^ 1/2 



1 n 

^ \K m (x,t)\W(t)dt 



1/2 



Ki(x,t)W 2 (t)dt 



by the Cauchy-Schwarz inequality. Here, using orthogonality, 

poo 

/ K 2 m {x,t)W 2 {t)dt < / K 2 m (x,t)W 2 {t)dt 

Jx-S^ J-oo 



-oo 
m— 1 



2n-l 



J'=0 j=0 



provided m < 2n. Thus for m < 2n, 

\s m [g](x)\ < \\fW\\ Laom i^X-^W 2 ,x)j , 
and hence, averaging over s m , n + 1 < m < 2n, 

\V n [g] (x)\<\\fW\\ L ^ m [—\- 2 ,l{W 2 ,x) 
Now comes the clever idea. Let 



2a 



n 



1/2 



H(t) 



x — t ' 



and denote its Fourier coefficients with respect to {pj} by {cj [H]}, so that 

oo 
3=0 

We use the Christoffel-Darboux formula (4.8) to write 

7m- 1 



(4.10) 



\s m [h] (x)\ 



7m 

7m- 1 
7m 



Pm{X) 



/oo 
H(t) Pm (t)W 2 (t)dt 
-oo 

\Pm(x)Cm-l [H] -p m -l{x)c m [H] \ . 



Then summing and using Cauchy-Schwarz, 



\V n [h] ( X )\ < - 



7m- 1 

max 

m<2n r y m 



2n 



\ V2 , 2 „ x 1/2 

2 ( E E 



(4.11) 



vm=0 



\m=0 
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Yet another clever idea: use Bessel's inequality on the Fourier coefficients of H (recall, this holds 
for any orthogonal system in any inner product space), 



^ n 00 poo r 

c 2 m (H) < Yl c2 m( R ) < / ( HW ) 2 = L 

m=0 m=0 J ~ oc J { 



(fmm' dt<UW f L 2 jl. 

{t#-*l>?} \ t-x J " L -V> a n 



Then (4.11) becomes 



\V n [h] (x)\ < 



2^2 



n 



7m- 1 



max 

m<2n 'y n 



1/2 



Finally, combining this with (4.9) and (4.10) gives 



|K [/] (x)\ < WfW\\ Loo(R) {\ 2n+l {W\x)y 112 H^L j + ~ 



1/2 



1 



7m- 1 



max 

m<2n "f n 



2V2 { — 



1/2' 



(4.12) 



Up to this stage, we have not used any properties of the weight, it's completely general. But now 
we use lower bounds for the Christoffel function 



and an upper bound 



XJW 2 ,x) > C— W 2 (x), igR, n>l, 
n 



7m- 1 , s-, 

max <Co„ 



(4.13) 
(4.14) 



m<2n r J rl 

to deduce that uniformly for n > 1, and i£R, 

\V n [f) (x)\W(x)<C\\fW\\ Lacm . 

Thus, for some C* 7^ C(n, /), 

\\Vn[f]w\\ Loom <a\\fw\\ Loo{u) . 

Then for any polynomial P of degree < n, the reproducing property of V n gives 

\\(f-V n [f])W\\ Loom = \\(f-P- Vn [f-P})W\\ Lacm 

< IK/ - P)w\\ Laoim + UK [/ - p] w|| Loo(R) 

< (i + c *)\\(f-p)w\\ LooiRr 

Taking the inf over all P of degree < n gives the right-hand inequality in (4.6). 

We turn to the discussion of (4.13) and (4.14). For the latter, we can just use restricted range 
inequalities: if m < 2n 



7m- 1 

7m 



Xp m - 1 (x)pm (x) W 2 (x) dx 



< C 



2a„ 



2o„ 



\xp m -i{x)p m (x)\ W 2 (x)dx < C2a r 



by Cauchy-Schwarz. The lower bound (4.13) is more difficult; see for example [83], |136j . [150J. □ 
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As an aside, let us see how duality also gives this bound in L\. We use duality of L p norms: 

/oo 
V n [f]gW 2 , (4.15) 
-oo 

where the sup is over all measurable g with Ha^Hl^r) < 1. (Sorry, we used g above in a different 
sense.) Now we use the self-adjointness of V n : 

/OO /'OO 
V n [/] gW 2 = / fV n [g] W 2 . (4.16) 
-oo J —oo 

This follows easily, once wg prove th.6 sclf-adjointness of s m '. 

/OO f-OO 
s m [f}gW 2 = / fs m [g]W 2 . 
-oo J —oo 

We leave the latter as an exercise (just substitute in the definitions of s m [/] and s m [g]). Combining 
(4.15) and (4.16) gives 

/oo 
fv n [g] w 2 < sup \\fw\\ Lim \\v n [g] w\\ Lxm 
-oo 9 

< C\\fW\\ Lm , (4.17) 

using the case p = oo. With this bound, we can finish off the rest of the proof as we did for the 
case p = oo. For 1 < p < oo, one can use interpolation of operators. It is possible to improve the 
estimate (4.6) to include factors that vanish near ±a n , reflecting improved approximation there, 
see [101 j . A partially successful attempt to extend (4.5) to general exponential weights was given 
in [102] . 

An obvious question is for which weights, we have the lower bound (4.13) for the Christoffel 
functions. They have not been established for the class T of Definition 3.3. However, they were 
established in [83] for the following class: 

Definition 4.6 Let W = exp(— Q), where Q" exists and is positive in (0, oo), while Q' is positive 
there, with limit at 0, and for some A,B > 1, 

A-l<^^<B-l, x€(0,oo). (4.18) 

Then we write W £ T* . 

For weaker (but difficult to formulate) hypotheses, these estimates were proved in [85J. An 
excellent exposition is given in [136, Chapter 3]. The Christoffel function bound (4.13) there is 
proved assuming Q" is increasing. This is true for W = W a if a > 2, while (4.18) holds for a > 1. 

The next step is an inequality that is a distant relative of Hardy's inequality: 

Lemma 4.7 Let n > 1 and g be a function such that 

g PW 2 = 0, (4.19) 
for all polynomials P of degree < n. Then there exists C ^ C{n,g) such that 



sup W(x) 



X 

g 







< \\ 9 W\\ Loom . (4.20) 
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Main idea of the proof. Fix x > 0, and let 

Mt) = w-\t) X[0 , x] (t). 

For g as above, and any polynomial P of degree < n, 



g{t)(l> x {t)W 2 {t)dt 

) 

g(t) [cf> x (t) ~ P(t)] W 2 (t)dt 



/oo 
\<t> x -P\w. 
-oo 



Taking the inf over all such P gives 



Once we have the estimate 



W{x)E n [<f> x -W] l <C—., 
n 



(4.21) 



for some C/C(n,i), the result follows. To prove this, one can use one-sided approximation as in 
Section 4.1. See, for example, [HI p. 34]. □ 

Now we can give the Jackson-Favard inequality in the case p = oo: 



Theorem 4.8 LetW G J 7 *. Let f be absolutely continuous in each finite interval, with \\f'W\\ L 
hnite. Then for some C ^ C(n,f), 



and 



Proof. Let 



Enif-W^KC^E^t [f';W] 



(4.22) 
(4.23) 



g(x) = f'{x) - V n [f] (x). 
This does satisfy (4.19). Indeed, if P is a polynomial of degree < n, self-adjointness of V n gives 

/OO /"OO f'OO 

gPW 2 = / f'PW 2 - / V n [f] PW 2 
-oo J —CO J —co 

/CO f'OO 
f'PW 2 - / f'V n [P] W 2 = 0, 
-oo J —oo 



as V n [P]=P. Next, let 



U n {x) = f(0) + / V n [f] (t)dt, 
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so that 



Then 



(f- Un )(x)= / (f'-V n [f']) 



\W(f-U n )\\r 



W I g < C ^\\gWl 
n 

, Q>n ll / *./ tt- r \ ttt ii . &r, 



Here we applied Lemma 4.7 and then Lemma 4.5. This also gives, as U n has degree < 2n, 

£ 2 n [/^L < c^E n [f-w]^ < ||/^|| ioo(R) • 

Replacing n by re/2, and using the fact that a n /2 < a„, we obtain 

En[^WU<C^\\fW\\ L ^ y 

Finally, we observe that for any polynomial P of degree < n, 

E n [/; W)^ = E n [f- P; < || (/ - Pyw\\ Loom . 

Choosing P' suitably gives (4.23). □ 

For the extension to all 1 < p < oo, see [136, Chapter 4]. 

4.3 The Kroo-Szabados Method 

The idea here |69j is to make use of the alternation/equioscillation for best polynomial approxi- 
mants, together with some clever tricks. On a finite interval, the idea was used by Bojanov [5], 
Babenko and Shalaev. While it works quite generally, it does not yield the correct Jackson rate. 
Maybe some clever tweaking can repair that? 

Let us fix a function /, and n > 1, and let P* be its best polynomial approximation of degree 
< n in the weighted uniform norm. Thus 



\\(f-KWh 



inf \\{f-P)W\\ u 

deg(P)<n 



Then there exist equioscillation points {yj}™ =0 such that for < j < n + 1, 

Kf-K)W] (y J ) = e(-iyE n [f;W} 00 . 
The number e £ { — 1, 1} is independent of j. In terms of these, there is the determinant expression 



E n [f;W} € 



u 



f 1 ... x n 

yo yi ■■■ y n +i 



u 



g 1 . . . x n 

yo yi ■■■ Vn+i 
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where for functions {4>j} 



U 



and g is a function such that 



yo 2/i ••• y«+i 1 



ij=0 



W (l/j) = (-I)*, 0<j<n + l. 

A proof of this formula may be found in |170|, p. 28]. By some elementary determinantal manipu- 
lations, one can show that 



U 



f 1 ... x n 

VO V\ ■■■ Vn+l 



(-i) B £(-i)*[/(yjH-i)-/(w)]B* 



k=0 



and 



U 



9 1 

yo yi 



x'- 

Un+l 



(_l)n+l [^-1 {yk+l) + W -l (yfc) ] 



fc=0 



Here -Bfc is the determinant of a matrix with entries involving only {yj}. Then one obtains 



where 



E n [f;W] 



dk 



J2(-i) k [f(yk + i)-f(y k )]d k 



k=0 



(4.24) 



Suppose we define the (unusual!) modulus 

\m-m\ 



sup 



x,y&BL, \x-y\<t W(x)~^ + W(y) 7 ' 
where 7 E (0, 1). Then from (4.24), 

n 

E n [/; W]^ < ]T |4| (W (y fc+ ir 7 + W (y fc )" 7 ) ^ 7 (/; \y k+ i - y k \ 



k=0 



Now one uses properties of the modulus u> 7 and then has to estimate the {dk}- This involves tricks 
such as needle polynomials. Here is a sample of what can be achieved: 



Theorem 4.9 Let Q 

for large x, and with 



be an even continuous function, which is positive and differentiable 



n v ■ r*Q'(x) / r xQ'(x) 

< hmmf < limsup— — — — < 00. 

x-*oo Q(x) x ^oo Q(x) 



(4.25) 



Let ^ denote the inverse of Q, defined for sufficiently large positive x. For small n, take x ' (n) 
to be 1. Assume that < 7 < 1, and / : R — > R is continuous, with 

lim /(x)H/ 7 (x) = 0. 

[a; [—Mao 
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(a) Then for some Cj ^ Cj (/, n) 



^ [/; < CW/; " (1 ^ )n " ) + e-^" H/VHI W M) . (4.26) 



(b) Let < £ < i^. Tien for some C ^ C (f,n) , 

E n L/WL < C lW7 (/;Z;^ +£ ), (4.27) 

where for large enough n, 

I n :=j® {n) 9^L dt . (4.28) 

Note that (4.25) requires less than we required for W £ J 7 , but the restrictions on / are more 
severe than in Theorem 3.5, and instead of 2? inside the modulus, we obtain essentially a " '° g - . 
The boundary case Q(x) = \x\ satisfies the above conditions, and in this case we obtain 

E n [f;W]oc < Ciu 7 (f\ (logn)-^ +£ ). 

Here by choosing 7 small enough, the exponent of logn can be made arbitrarily close to —1. 

This method is interesting, and general. The challenge is how to tweak it, if possible, to get the 
correct Jackson rate. 



4.4 The Piecewise Polynomial Method 

This is undoubtedly the most direct and general method, and Ditzian and the author used it to 
prove Theorem 3.5. However, it does pose substantial technical challenges. For finite intervals, it 
has been used for a long time, and in spirit goes back to Lebesgue's proof of Weierstrass' Theorem. 
Lebesgue first approximated by a piecewise linear function, and then polynomials. It has served 
as a powerful tool on finite intervals, for example in investigating shape preserving polynomial 
approximation, the degree of spline approximation, and even approximation by rational functions 
[30], [T56] . 

The function / is first approximated by a piecewise polynomial (or spline) . Each of the piecewise 
polynomials is generated via Whitney's Theorem. Then special polynomials that approximate 
characteristic functions are used to turn the spline approximation into a polynomial approximation. 
We illustrate the method as it is used to prove Theorem 3.5 for Freud weights. 

Step 1: Partition [— a n , a n ]. Recall that our modulus uj r , P involves a tail piece that will take care 
of the behavior of f(x) for very large x. So we fix n and concentrate on approximation on the 
Mhaskar-Rakhmanov-Saff interval [—a n ,a n ]. We partition this interval into small intervals, all of 
length ^ : 

n 

-a n = t < n < r 2 < • • • < r 2n = an- 
Set 

l r .= [r jt r j+x ] t 0<j<2n-l. 
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Step 2: Use Whitney's Theorem to develop a piecewise polynomial approximation. 

We fix r — this will be the order of the modulus. We approximate / on Ij by a polynomial pj of 
degree < r, and then form the piecewise polynomial 

2n-l 

S [/] := PO + Yl (Pi - Pi-l) Xfa,a n ]> 

where, as usual, X[T,-,a„] denotes the characteristic function of the interval [rj,a n ]. When restricted 
to Jfc, the sum becomes a telescopic sum: we see that in the interior of Ik, namely in (rk,Tk+i), 

k 

S[f]=Po + Y^ (Pi ~ Pi-i) = Pfc- 
i=i 

Then, if p < do, 

2n-l ,. 2n-l 

ii(/-^w)^[-«.^]= E / i(/-w)wr < ^e^n p n/-Piii^)-(4.29) 

3=0 JI i 3=0 

In this step, we use the fact that because the intervals Ij have length W does not grow or decay 
by more than a constant. The idea is that for x,y £ 

W(x)/W(y) = exp(Q(y) - Q(x)) = exp(Q'(£)(x - y)) < exp (V^) < C, (4.30) 

since Q' is bounded by C^- throughout [— a n ,a n ]. (This is true for Freud weights in the class T* 
but fails for Erdos weights.) Now comes the application of Whitney. Let 

I*=IjUl j+1 . 

By Whitney's Theorem on the interval J| |156|, p. 195, p. 191], we can choose pj of degree < r such 
that 

\\f- Pj \\ P L( r^<C- / \Kf^)\ P dxds=:C^. 

L PVj) a n J Jj* j 

Here C is independent of f,n,j. The strange creature on the right is really the pth power of an 
rth order integral modulus of continuity on the interval Ij. In forming the rth difference in this 
integral, one uses the convention that the difference is taken as if any of the arguments of the 
function are outside the interval Ij. Substituting this in (4.29) and using (4.30) gives 

WU-S[f])W\\l n] < C- / \WA:f( X )fdxds 

u n JO J —On 

< C sup \\WA r h f\\ p T , , . (4.31) 

— n 

Step 3: Approximate the characteristic function X[r-,a n ]- Now comes the difficult part. We 
approximate X[Tj,a„] by a polynomial Rj of degree < Ln giving the polynomial 

2n-l 

p[f]=po+ E (pj-Pi-i)^ 

3=1 
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which is of degree at most Ln + r. Here the constant L is independent of n, and arises in the 
challenging task of generating Rj . We see that then 



2n-l 



s(f) -P[f]=J2 (Pi ~ Pi-i) - r j) ■ 



(4.32) 



To estimate this, we compare pj and Pj-i on the common interval where they approximate /, 
namely Ij. (That is why we used Whitney on I* = Ij U Ij+i-) We obtain, even for p < 1, 

\\Pj ~ P3-^\\l p (h) - C ("j" 1 + n i) • 

Now we use Nikolskii inequalities, which compare the norms of polynomials in L p and L q , and the 
Bernstein- Walsh inequality, which bounds the growth of polynomials outside an interval, once we 
know their size on the interval. Together they yield for all real x, 



\p j -p j - 1 \(x)<c(^-) 1/P (l + f 



X-Tjl) + % 



(4.33) 



Again, the constant C does not depend on x, or n, or /. It does however depend on r, which 
crucially remains fixed. The ^ factor arises from the length of ij. Suppose now that for a given 



X[Tj,a„ 



— 



(x) 



W(x) 



n 



< C 1 + — Is - r,- 



x 6 



(4.34) 



where C is independent of f,n,j,x. Then substituting this and (4.33) into (4.32) gives 

l/p 2n-l / \ 



\S[f]-P[f]\(x)W(x) <C 



From here on, we need to proceed a little differently for p < 1 and p > 1. Let us suppose p > 1. 
By Holder's inequality, with g = ^j-, 



|S[/]-P[/]| p (x)W(x) < C 



2n-l 



n 



X — Ti 



(r-e)p/2 



W^Tj)^ 



2n-l 

£(' + 

J'=l 



n 



(r-i)q/2 



p/q 



(4.35) 



We also use the fact that 1 + ^- |x — Tj| is bounded by a constant times 1 + ^ |x — Tj-±\ throughout 

/ \(r-£)q/2 

the real line. Next, if £ > r, the function u i— > ( 1 + ^- \x — u\ j is increasing in (— oo, x) and 

decreasing in (x, oo), so we can bound the second sum by an integral: 



2n-l 



X — To 



(r-l)q/2 



n 

< 2— 



1 H x — u 



(r-e)q/2 



du + l<C, 
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with C ^ C (n,x), provided only (r — £) q/2 < —1. So all we need is that I is large enough. Now 
we integrate (4.35): 



2n— 1 / 

\\(s[f]-p[f])w\\l m < c^J2 WP (^ nP j 1 + 

j=l J —oo \ 

2n-l 

again, provided t is so large that (r — £)p/2 < — 1. Finally, 

2n-l ran/n f2n-l 



n 
a,, 



{r-t>p/2 



dx 



(4.36) 



ds 



J^WfojnJ = -/ Y. W ' P ^) \Kfix)\ P dx 

|W(x)AJ/(x)Pcfa = C^sup^ \\WAlf\\l p[ _ anAn] 

Combining this, (4.31), and (4.36), gives 



< C sup 

0<h<3^ J-a n 



\\U-P[f])w\\ 



P T rm <c\ sup \\WA r h f\\ p r r , + n , 

L P (R) - 1 0<h <kn Hip[-an,an] IIJ n -MM>«n) 



That's it! Reformulating this in terms of the modulus of continuity is relatively straightforward. 
Of course we assumed: 



Step 4: Construction of the {Rj}- Recall we want Rj of degree < Ln, satisfying 

W(x) ( n \~ e 
\X[r,a n ]- Rj\(x)-^^ < C \1 + — \x - t\\ , x eR, t e [-a n ,a n ] . 



(4.37) 



with constants independent of T,n,x. The problem here is the W(t) in the denominator. It's tiny 
for r close to a n , and we want Rj to approximate 1 in [r, a n ], and to approximate elsewhere in 
R. One starts with an even entire function 

oo 

G(x) = J292jX 2 > 

j=0 

with all g2j > such that 

Ci < {GW){x) <C 2 , x G E. 
Such functions were constructed in [87] • We let G n denote the nth partial sum. One can show that 

(G n W)(x)<C 1 , xeR, 

and 



Ci < (G n W) (x) < C 2 , \x\ < Cia n . 
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Next, we need needle or peaking polynomials V n ^, built from Chebyshev polynomials, satisfying 

ll^llioct-i,!] = v ™,d0 = !; 



n\t-£\ 

and 



^(*)>i, |t- f |<c^Hl. 

2 n 
The constants B,C are independent of n,£,x. We define [331 P- 121] 

R ,s = Iq G Ln / A {s)V ni t.{s/a 2 Ln) L ds 
3 If G Ln/i (s)V n ^(s/a 2Ln ) L ds 

where L and r* are appropriately chosen, and £ = t jdhn- To prove this works, we split the integral 
into various pieces, consider several ranges of x, and reduce other ranges to the main range, where 
r € [S,a n ], for some fixed S. All the details appear in [33] . 

We note that for Erdos weights, or exponential weights on [—1, 1], the technical details are still 
more complicated — the subintervals Ij of the [—a n ,a n ] are no longer of equal length. See 

]• M- 



5 Weights Close to exp(— |x|) 

The weight exp(— |x|) sits "on" the boundary of the class of weights admitting a positive solution 
to Bernstein's problem. That boundary is fuzzy, but if you recall that W a (x) = exp(— \x\ a ) admits 
a positive solution iff a > 1, this makes sense. So it is not surprising that results like Jackson's 
theorem, tend to take a different form. Freud was interested in this boundary case right throughout 
his research on weighted polynomial approximation. In 1978, Freud, Giroux and Rahman 
p. 360] proved that 



EnlfrWx}! = inf ||(/ - P)W 1 \\ Lm < C 

deg(P)<n 



u (f>r—)+l \fw 1 \{x)dx 



where 



/oo pco 
KfW^x + ^-ifW^ldx + e \fWi 
-oo J — oo 



Here C is independent of / and n, and ^/n could be replaced by n 1 s for any fixed 5 S (0,1). 
Compare the t-^- inside the modulus to the n ~ 1+1 / a we obtained for W a ,a > 1. This suggests 

log 71 

that 

lim n ~ 1+1 / a = — !— , (5.1) 
a-+l+ logn 

at least in the sense of Jackson rates! Ditzian, the author, Nevai and Totik [34] later extended 
this result to a characterization in L\. The technique used by Freud, Giroux and Rahman was 
essentially an L\ technique, using the relation between one-sided weighted approximation, Gauss 
quadratures, and Christoffel functions — as we discussed in Section 4.1. 
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Only recently has it been possible to establish the analogous results in L p , p > 1 [98]. The 
author modified the spline method discussed in Section 4.4 above. The challenge is that however 
you tweak the construction of {Rj} there, it does not work. So a new idea was needed. As 
the peaking polynomials used there do not work for Wi, they were replaced by the reproducing 
kernel for orthogonal polynomials for Wf, and in the proofs, the author needed bounds for these 
orthogonal polynomials, implied by recent work of Kriecherbauer and McLaughlin |68j . 

i i 

If we examine the modulus used in (3.13) for W a ,a > 1, we see that the interval [— h 1 -" , h 1 -"] 
is no longer meaningful for a = 1. It turns out to be replaced by [— exp (^p) ,exp (^p)], for 
some fixed e € (0, 1). The modulus becomes 

The author proved |98j : 

Theorem 5.1 For < p < oo, and n > C3, 



E n [/; Wtjp < Ctu^if, W U y _ t „ _J . (5.3) 



1 

log (C 2 n) 

Here Ci, C2 are independent of f and n. 



While this may be a technical achievement, it is scarcely surprising, given that Freud, Giroux and 
Rahman already had the rate O (^j^-^j m the L\ case. What about a Jackson-Favard inequality? 
The "limit" (5.1) suggests that an analogue of (3.10) should have the form 

^ [/; ^-iof^ limil ^ R )- 

Remarkably enough this is false, and there is no Jackson-Favard inequality for W±, not even if we 
replace by a sequence decreasing arbitrarily slowly to 0. More generally, we answered in [99] 
the question: which weights admit a Jackson type theorem, of the form (3.10), with {oin/ra}^ =1 
replaced by some sequence {rj n ]'^ =l with limit 0? We proved [99] : 

Theorem 5.2 Let W : M — > (0, 00) be continuous. The following are equivalent: 

(a) There exists a sequence {rjn}^^ of positive numbers with limit and with the following 
property. For each 1 < p < 00, and for all absolutely continuous f with \\f'W\\ L m\ finite, 
we have 

inf \\(f-P)W\\ Lpm <r )n \\f'W\\ Lp(R) , n>l. (5.4) 

deg(F)<n 



(b) Both 



and 



km w(x) f w- 1 = (5.5) 

x-*oo J Q 

lim W{x)- 1 / W = 0, (5.6) 



with analogous limits as x — > — 00. 
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Two fairly direct corollaries of this are: 

Corollary 5.3 Let W : R — > (0, oo) be continuous, with W = e~® , where Q(x) is differentiable 
for large \x\, and 

lim Q (x) = co and lim Q'(x) = —oo. (5.7) 

x-^oo x—>—oo 

Then there exists a sequence {?7n}^Li of positive numbers with limit such that for each 1 < p < oo, 
and for all absolutely continuous f with \\f'W\\L (r) finite, we have (5.4). 

Corollary 5.4 Let W : R — > (0, oo) be continuous, with W = e~® , where Q(x) is differentiable for 
large \x\, and Q'(x) is bounded for large \x\. Then for both p = 1 and p = oo, there does not exist 
a sequence {i] n }'^ =1 of positive numbers with limit satisfying (5.4) for all absolutely continuous 
f with \\f'W\\ Lp(R) finite. 

In particular for W\, there is no Jackson-Favard inequality, since both (5.5) and (5.6) are false. 
Thus there is a real difference between density of weighted polynomials, and weighted Jackson- 
Favard theorems. It is possible to have the former without the latter. 

A key ingredient in the above theorem is an estimate for tails [99]: 

Theorem 5.5 Assume that W : R — > (0, oo) is continuous. 

(a) Assume W satisfies (5.5) and (5.6), with analogous limits at — oo. Then there exists a decreasing 

positive function r] : [0, oo) — ► (0, oo) with limit at oo such that for 1 < p < oo and A > 0, 

\\fW\\ Lp(M \ hx>x]) < V (X)\\f'W\\ Lpm (5.8) 

for all absolutely continuous functions f : R — > R for which /(0) =0 and the right-hand side 
is finite. 

(b) Conversely assume that (5.8) holds for p = 1 and for p = oo, for large enough A. Then the 

limits (5.5) and (5.6) in Theorem 1.1 are valid, with analogous limits at —oo. 

The above results deal with L p for all 1 < p < oo. What happens if we focus on a single L p 
space? We recently proved [lOOj : 

Theorem 5.6 Let W : R — > (0, oo) be continuous and let 1 < p < oo and ^ + - = 1. The following 
are equivalent: 

(a) There exists a sequence {rj n }'^' =1 of positive numbers with limit and with the following 
property. For all absolutely continuous f with ||/'W^||l p (r) finite, we have 

inf \\(f-P)W\\ Lpm < 7]n \\f'W\\ Lpm , n>\. (5.9) 

deg(F)<n 



(b) 

\L q [0,x] 11 \\L p [x,oo) 

with analogous limits as x — > — oo. 



J^IL^J^IL [ ,,oo) = 0, (5.10) 
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As a consequence one can construct weights that admit a Jackson theorem in L p but not in L r 
for any 1 < p, r < oo with p ^ r: 

Theorem 5.7 Let 1 < p, r < oo with p ^ r. There exists W : M — > (0, oo) such that 

— ^-s < W(x)/exp(-x 2 ) < 1 + x 2 , x£R, 
1 + x 1 

and W admits an L r Jackson theorem , but not an L p Jackson theorem. That is, there exists 
{^nl^Li w ith limit at oo satisfying (5.9) in the L r norm, but there does not exist such a sequence 
satisfying (5.9) in the L p norm. 

This is a highly unusual occurrence in weighted approximation — in fact the first occurrence of 
this phenomenon known to this author. Density of polynomials, and the degree of approximation 
is almost invariably the same for any L p space (suitably weighted of course). Recall Koosis' remark 
quoted after Theorem 1.6 [671 PP- 210-211]. 

6 Restricted Range Inequalities 

We have already seen the role played by restricted range inequalities (often called infinite-finite 
range inequalities) in weighted Jackson theorems. Paul Nevai rated Freud's discovery of these 
as one of his most important contributions to weighted approximation theory. One is reminded of 
their import when one recalls that Dzrbasjan did not have these, and could only prove estimates 
in a fixed finite interval. This should not detract from admiration for the generality of Dzrbasjan's 
results, and the sophistication of his ideas, which are still being used. 

Let's recap a little. Recall that Freud and Nevai proved that there are constants C\ and Ci 
such that for all polynomials P n of degree at most n, 

\\PnW a \\ Lp{R) < Ci\\P n W a \\ Lp [_ C:in i/ a>Cin i/ a y (6.1) 

The constants C\ and C2 can be taken independent of n, P n and even the L p parameter p £ [1, 00]. 
Outside the interval [— Cin l / a , Cira 1 /"] , P n W a decays quickly to zero. For more general W = 
exp(— Q), one replaces n 1 /", with Freud's number q n , the positive root of the equation 

n = q n Q' (q n ) ■ 

If xQ'(x) is positive, continuous, and strictly increasing in (0,oo), with limit at 0, and 00 at 00, 
then q n exists and is unique. 

Theorem 6.1 Assume that xQ'{x) is positive, continuous, and strictly increasing in (0,oo), with 
limit at 0, and 00 at 00. Then for n > 1 and polynomials P of degree < n, 



(6.2) 
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Proof. This type of proof was used many times by Freud and Nevai; see also the monograph [136, 
p. 66]. Recall that if T n is the classical Chebyshev polynomial, then for \x\ > 1 and polynomials P 
of degree < n, 

\P{x)\ < T n (\x\) ||P|| Loohlil] < (2 |z|) n \\P\\ LachlA] ■ (6.3) 
Scaling this to [—q2niQ2n], an d using the fact that Q is increasing in (0, oo) gives 

(21 I \ n ( 2l I \ n 

\\ p \\ Loo[ - q2n , q2n] <{-^) W'\ q2n )\\PW\\ Loo[ _ q2nmn] . 



So 



Now if x > 4q 2n , 



\PW\ (x) < 2" \\PW\ 



x n W(x) f x n-uQ'(u) , f Aq2n du , , 

log v = / ^^du<-n — = -nlog4, 

q2n W (<l2n) J q2n U J Q2n U 

since u > q 2 n uQ'(u) > In. Substituting in above, gives for x > 4:q 2n , 

\PW\{X) < 2~ n \\PW\\r r „ Q „ ! . 

This certainly implies (6.2). □ 

The proof clearly shows the exponential decay of PW outside the interval [— q 2n ,q2n], which 
depends only on n. In L p , any p > 0, a typical analogue is 

\\PW\\ Lpm < (l + e~ Cn ) \\PW\\ Lp[ _ Bq2ntBg2n] 

where C > 0, and B is large enough. 

These inequalities are sufficient for weighted Jackson and Bernstein theorems, but not for some 
of the deeper results such as Bernstein inequalities with endpoint effects, or convergence of or- 
thogonal expansions, or Lagrange interpolation, let alone asymptotics of orthogonal polynomials. 
With such questions in mind, Mhaskar and Saff asked in the early 1980's: where does the sup or 
L p norm of a weighted polynomial really live? Clearly it is inside the intervals [—Bq 2n , B q2n ], with 
appropriate B, but what is the best B1 In seminal papers [1381, [139] . [140] . they used potential 
theory to obtain the answer. In another seminal development, E. A. Rakhmanov a little earlier [166J 
developed the same potential theory in order to investigate asymptotics of orthogonal polynomials. 

One of the best ways to understand their work is to recall Bernstein's bound for growth of 
polynomials in the complex plane. For n > 1, polynomials P of degree < n, and z ^ [—1, 1], 



\P{z)\ < z+V 



KlLoof-Ul 



Thus once we have a bound on P on an interval, we can estimate its growth outside. Of course, it 
is related to the inequality involving T n that we used above, but that works only for real x. 

Now let us look for a weighted analogue for even weights W = exp(— Q), where, say, x i— > xQ'(x) 
is increasing, positive, and continuous in (0,oo). Suppose that a > 0, and we have a function 
G = G nta with the following properties: 
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(I) G is analytic and non-vanishing in C\[— a, a]; 

(II) G{z)z n has a finite limit as \z\ —> oo; 

(III) \G{z)\ -► W(x) asz->x£ (-a, a). 

Then given a polynomial P of degree < n, the function PG is analytic in C\[— a, a], including 
at oo, where it has a finite limit. Moreover, for x £ (—a, a) , 

lim \PG\{z) = \PW\(x) < \\PW\\ Loc[ _ aM . 
By the maximum modulus principle, we then obtain 

\PG\{z)<\\PW\\ Lao[ _ aM , z£[-a,a]. 
In particular, for real x with |x| > a, 

\PW\(x) < \\PW\\ Locha , a] W(x)/G n , a (x). 
For a given n, Mhaskar and Saff found the smallest a = a n for which 

W(x)/G nta (x) < 1 for all |x| > a. (6.4) 

Clearly, we then have 

\PW\(x)<\\PW\\ Loo[ _ an , an] , \x\>a n , 
and the Mhaskar-Saff identity 

\\ p W\\ Loo (R) = \\ p W\\ Loc [-a n ,a n }- 

Recall that a n is called the Mhaskar-Rakhmanov-Saff number, and is the positive root of the 
equation 

2 f 1 a n tQ' (a n t) 



n = - " ^ v " ' dt. (6.5) 
vr7 



For the weight W a {x) = exp(— |x| a ),a > 0, 



- 2 £Wg)! \ 1/a n i/ a 



n \ r(«) j 

How does this arise and how does one find the function G n ,at We solve the integral equation 



L 



Qiax) 

log \x - t\ Hn, a {t)dt = h c n , a , x £ [-1,1], (6.6) 



for the function /x ni<1 , and some constant c nA , subject to the condition 



a 

l^n,a = 1- 

-a 
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By (formal) differentiation, we see that this is really equivalent to solving a singular integral equation 

1 Mn,a(X) aQ'(ax) 

— ■ at = , x G (—a, a) . 

1 x — t n 

The integral on the left is a Cauchy principal value integral. There is a well developed theory of 
such equations [H], ffSZ]- One can show that [mi P- 37], [HU p. 124 ff.] 

2 f 1 sf\ - x 2 asQ'{as) - axQ'(ax) B n a 

71-2 7o Vl - s 2 n{s 2 -x 2 ) vrVl - x 2 

where 

Then the function G nja is given by 

G n ,a{z) = exp ^-n J log ^ - Hn,a(t)dt + nc n , a 

so that for x £ [—a, a] , 

\G n , a (x)\ = exp ^-n J log ^ - f n na {t)dt + nc„ ia ^ = W(a:), 

by (6.6). So at least we have verified (III) above. It turns out that whenever B n ^ a > 0, then 
W/G n a > 1 in some right neighborhood of 1. Thus we look for the smallest a for which B n a — 0. 
We see from (6.8) that this requires 

2 f 1 atQ'jat) 

1 / — at = ==> a = a n . 

nir J Vl - t 2 

This scratches the surface of an extensive theory. The monograph of Saff and Totik [167J 
contains a detailed and deep development. A more elementary treatment is provided in [136J, 
though that will be more than sufficient for those interested only in the topics of this survey. For 
general exponential weights, restricted range inequalities in quite precise form are investigated in 
[85]. Here is a typical result [85, Theorem 1.8, p. 15], [Ml Thm. 6.2.4, p. 142]: 

Theorem 6.2 Let Q : R — * [0, oo) be even and convex, with limit oo at oo and 

= Q(0) < Q(x), x^O. 

(a) For not identically zero polynomials P of degree < n, 



and 



I 1 1 flR) II [_ a n ,a n ] 
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(b) Let < p < co and P be a not identically zero polynomial of degree < n — |. Then 



and 



\pw\\ Lp(R) < \\PW\\ Lp[ _ an7an] 



\ PW \\L p (R\[-a n ,a n ]) < \\ PW \\L p [-a n ,a„] 



Quite often, we need a smaller estimate for the tail, and this is possible provided we omit a 
little more than the Mhaskar-Saff interval. Here is what can be achieved |85|, Thm. 1.9, p. 15] for 
the class of weights J-* , specified in Definition 4.6: 

Theorem 6.3 Let W S T* and < p < oo. For n > 1, k G (0, 1] and polynomials P of degree 
< n, 

W PW \\L p (R\[~a n (i + ,),a n (i+K)]) < Ci exp(-nC 2 K 3 / 2 ) \\PW\\ Lp[ __ antan] . 
Here C\ and C2 are independent of n, P, k. 

More generally, it was proved there 

Theorem 6.4 Let W = e~ Q , where Q : R [0, 00) is even and convex, with Q having limit 00 at 
00, and 

T(t) = *2M 
( j Q(t) 

is quasi-increasing in the sense of (3.27) of Definition 3.11. Assume furthermore that for some 
A > 1, T > A in (0, 00), while 



T(y) ~ T [y 



1 1 



y£ (0,oo). (6.9) 



Then for n > 1, k G (0, T ^ r ] and polynomials P of degree < n, 

\\PW\\L pm -a n (l +K ),a n{ i +K ) ] ) < C ie xp(-C 2 nT(a n ) K 3 / 2 ) ||P^|| ip[ _ 0n)0ri] . (6.10) 
Here C\ and C2 are independent of n, P, k. 

Finally, we can go back a little inside the Mhaskar-Saff interval if we allow a cruder estimate 
on the tail [851 Theorem 4.2(a), p. 96]: 

Theorem 6.5 Assume in addition to the hypotheses of Theorem 6.4 that Q" exists and is positive 
in (0, 00), while for some C > 0, and large enough x, 

Q"(x) < Q'(x) 
Q'(x) ~ Q(x)' 

Let < p < 00 and A > 0. Then there exists uq and C such that for n > uq and polynomials P of 
degree < n, 

\\ PW \\l p (R) ^ C ll P ^ll[-an(l-Ar?„),a„(l-Aj7™)] ' ( 6 - U "> 

where 

Tjn = (nT (a n ))- 2/3 . (6.12) 
See [57], [61], [85], [134] , |136j . |145j . [167] for further discussion of restricted range inequalities. 
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7 Markov— Bernstein Inequalities 

Markov-Bernstein inequalities have already been mentioned above. They are the main ingredient 
of converse theorems of approximation, but also enter in many other contexts. For the unweighted 
case on [—1,1], the classical Markov inequality asserts that 

ll^1lw-i,i]^ n2 ll p llw-i,ip ^ 

for n > 1 and all polynomials P of degree < n. The Bernstein inequality improves this as long as 
we stay away from the endpoints: 

71 

l p/(x) l - TrT^f ii p n w-M . x G !) > ( 7 - 2 ) 

for n > 1 and all polynomials P of degree < n. 

The weights W a (x) = exp(— |x| a ),a > 0, already provide a lot of insight into the weighted case 
on the real line. The analogue of the Markov inequality is 

{n 1 " 1 /", a > 1 
log(n + l), a = l . (7.3) 
1, a < 1 



This is valid for all n > 1 and polynomials P n of degree < n. The constant C depends on p £ (0, oo] 
and a, but not on n or P. It's no accident that the factors arising are similar to those in the Jackson 
rates, and for a < 1, the factor is bounded independent of n. For a > 2, these inequalities were 
proved by Freud; for 1 < a < 2, by Eli Levin and the author; and for a < 1, by Paul Nevai and 
Vili Totik. 

In order to generalize this for arbitrary Q, we should try to cast this estimate in a unified form. 
For a > 1, we see that 



n 



1-1/ a 



it 



C 1 — = C 2 Q' (a n ) 



where Q(x) = and recall, a n = C\n l l a . As we shall see, Q' (a n ) is the correct factor whenever 
W = exp(-Q) and Q is even and grows faster than \x\ a for some a > 1. For Freud weights, where 
Q is of polynomial growth, but still grows faster than \x\ a , for some a > 1, we can use — . However, 
for all polynomial rates of growth, including |x| a ,a < 1, we can use 



On 



ds 



where if Q is strictly increasing and continuous on [0, oo), Q' -1 ' denotes its inverse. The most 
general result for Q of polynomial growth, with this factor, is due to Kroo and Szabados \69\ 
p. 48]: 

Theorem 7.1 Let W = exp(-Q), where Q is even, continuous, increasing in (0, oo), and twice 
diSerentiable for large enough x, with 



(7.4) 
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and 

v X Q"( X ) „ 

limsup < oo. (7.5) 

x^oo Ctf \X ) 



Then for n > 1 and al/ polynomials P of degree < n, 

f Cn ds 

fTereC/C(n,P). 

The author and Eli Levin used the slightly more restrictive condition that W £ J 7 *, so (4.18) 
holds. This implies, like (7.4) and (7.5), that Q grows faster than some positive power of x, but 
slower than some other positive power of x. Strangely enough, there does not seem to be a published 
form of this result in L p . Indeed, extra difficulties arise in L p — we shall discuss techniques for 
proving Markov-Bernstein inequalities in the next section. However, there are results that apply 
to L p separately for the case where Q grows faster, or slower, than \x\. First, we record the former 
case [84, p. 231]: 

Theorem 7.2 Let W £ J-* . Let 1 < p < oo. Then for n > 1 and all polynomials P of degree < n, 

\\P'W\\ T (m < C— \\PW\\j m . 

II llLp(R) - Qn II HLp(M) 

HereC^ C(n,P). 

This was later extended to the case < p < 1 |85l Corollary 1.16, p. 21]. For weights growing 
slower than Nevai and Totik |151[ Thm. 2, p. 122] used a beautiful method to prove: 

Theorem 7.3 Let W = exp(— Q), where Q is even, increasing, and concave in (0, oo), with 

Q(x) 

T;dx < OO. 

1 + x z 

Then for n > 1 and all polynomials P of degree < n, 



o 



\ P 'WL ( m<C\\PW\\ Lo 



Here C ^ C (n, P). If in addition, 



,. 2Q{x)-Q(2x) 

hm = oo, 

x^co log X 



then, given p > 0, we have for n > 1 and all polynomials P of degree < n, 

\\ P 'W\\L pm ^ C \\ p W\\ Lpm - 

HereC ^C(n,P). 
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The above are all Markov inequalities, analogues of (7.3). There are also Bernstein inequalities, 
which reflect the opposite feature of the finite interval case. The growth in n decreases towards the 
end of the Mhaskar-Rakhmanov-Saff interval. To state these, we need the function 

4> n (x) := maxil - — ,n~ 2 / 3 l , i£i 
I a n J 

Eli Levin and the author proved [841 Thm. 1.1, p. 231]: 

Theorem 7.4 Let W £ J-* and let 1 < p < oo. Then for n > 1 and all polynomials P of degree 
< n, 



HW^ 1/2 ||L p (R)<Cf \\PW\\ LpW . 



HereC ^C(n,P). 



For p = oo, this was proved in [82], and for 1 < p < oo, in [84] . If p = oo, we see that this has 
the consequence 

n 2/3 

\(PW) f (a n )\<C— \\PW\\ Laom , 

when P has degree < n. There is in [841 Thm. 1.2, p. 233] an analogue of this for weights satisfying 
(4.18) in Definition 4.6 with A > only (such as exp(— \x\ a ) , a < 1). One fixes t] S (0,1), and 
proves that 

|| (P^)Vn V2 IU P (I*I>^) < \\PW\\ Lpm ■ 

Note that in the Bernstein inequalities, it is essential that we estimate (PW)' , and not P'W. There 
is no improvement for the latter, at least in general, near ±a n . 

Analogues of Markov and Bernstein inequalities have also been obtained for Erdos weights, 
where Q is of faster than polynomial growth, as well as for exponential weights on (—1, 1). They 
are also available for non-even exponential weights on a possibly asymmetric finite or infinite interval 
[85] . For simplicity, we quote only the even case. First, we define a suitable class of weights, which 
includes both the Freud and Erdos weights in the real line, and exponential weights on (—1,1): 

Definition 7.5 Let I = (—d,d) where < d < oo. Let Q : I — > [0, oo) be an even function with 
the following properties: 

(a) Q' is continuous and positive in I and Q(0) = 0; 

(b) Q" exists and is positive in (0, d) ; 
(c) 

lim Q(t) = oo; 

t — 

(d) The function 

Tit) = 

is quasi-increasing in (0, d), in the sense that 

0<x <y <d T(x) < CT(y). 
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Moreover, for some A > 1, 



(e) There exists C\ > such that 



T(t) >A>1, t€(0,d). 



s 6 (0, d) . 



Q'(x) ~ ~ x Q(x) ' 
Then we write W £ Teven (C 2 ) ■ ^ aiso, there exists c G (0, <i) such that 

x e (c, d) , 



|Q"(s)| ^ c Q'(x) 



Q'(x) -~*Q{x)' 
then we write W £ -T^en (C 2 +). 

Examples of weights satisfying this are W a ,a > 1, as well as W = exp(— Q), where 



Q(x) = exp e (\x\ a ) — exp^(0), x G (— oo, oo) , 



(7.6) 



with £ > 1, a > 1, and 



exp^ := exp((- • -exp())) 
denoting the ith. iterated exponential. We set exp (x) := x. Others include 

Q(x) = exp^(l - x 2 ) j - exp^(l), are (-1,1). 

where now t > and a > 0. 

Theorem 7.6 Let < p < oo and W = e~ Q £ T even (C 2 ) . For n> 1, let 



(7.7) 



1 - 



Q2n 



|X| 



+ (nT (a„)) 



-2/3 



(7.8) 



Then for n > 1 and polynomials P n of degree < n, 



\(PW)'<p n \ 



< C\\PW\ 



Moreover, 



\P'W\\ T , K .<C-T{a n f/ 2 \\PW 



This result is a special case of Theorem 1.15 and Corollary 1.16 in [851 p. 21]. There the 
restrictions on Q are weaker, but the definition of the classes is more implicit, so we restrict 
ourselves to the smallest even case considered there. The main feature is the extra factor T{a n ) 1 / 2 , 
which really is there — weighted extremal polynomials attain it, see 

As examples, Q of (7.6) has Markov factor [851 p. 30] 



1/2 



T(a n ) 



1/2 



n 



(logi n) 
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Here 

log j := log (log (• • • log ())) 
denotes the jth iterated logarithm. For Q of (7.7) and I = 0, the Markov factor is [851 P- 32] 



while if £ > 1, it is [851 P- 34] 

-TK) 1 / 2 ~ n(log«n)5( 1+ -) ( Y[lo gj 



n 
a r , 



'£-1 

_ a 



8 Methods to Prove Markov— Bernstein Inequalities 

In this section, we outline some of the methods that have been used to prove Markov-Bernstein 
inequalities for exponential weights. An extensive treatment of Markov-Bernstein inequalities for 
both weighted and unweighted cases is given in [9], [143]. We begin with 

8.1 Freud's Method via de la Vallee Poussin Means 

Recall that in Lemma 4.5 we proved 

\\Vn[f)W\\ Loo{R) <C\\fW\\ Loom , 
where C ^ C (re, /). Thereafter, at (4.17), we showed how duality can be used to prove 

\\Vn[f]W\\ Li{R) <C\\fW\\ Lim , 

where C 7^ C (n, /). In much the same way, one can prove that when W G J 7 *, 

\\VMW\\ Laa( ^<C-\\fW\\ Loo(m . (8.1) 
The main new technical ingredient required is the estimate 

n-l 



k=0 



This is a cousin of the Christoffel function estimate (4.13) and can be proved using much the same 
ideas. Freud did this in [42] . A proof is also given in [1361 p. 64]. Now if P is a polynomial of 
degree < n, then 

Vn [P] = P- 

So (8.1) immediately gives the Markov-Bernstein inequality 
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II^WIIl.cr) < c- \\PW\\ Loo(m . 
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To extend this to Li, we use duality on V^: 

ll^[/]W r |L 1 ( R ) = sup / V^[f]gW 2 



(8.3) 



where the sup is taken over all measurable functions g with HgWH^^ < 1. Next, we use the fact 
that if P is a polynomial of degree < 2n, then 



/oo 
(9-V 4n [g])PW 2 = 0. 
-oo 



Indeed, we can prove this by considering the special case P = pj, < j < 2n, and recalling that 
the first In Fourier series coefficients (with respect to {pj}) of V^ n [g] are the same as those of g. 
The above considerations and an integration by parts give 



/oo 
K\f\ 
-oo 



gW 2 = 



VMV An [g] W 2 
V n [/] Vi n [g] W 2 + 2 V n [/] V An [g] Q'W 2 

oo J —oo 



(8.4) 



Here 



Vn[f]Vi n [g] W' 



< \\VLl9]W\ 



/oo 
\Vn[f}\ 
-oo 



w 



n 



n 



< c- \\gW\\ L , R) \\fw\\ Li{R) < c- \\fw\\ Li 



(8.5) 



In the second last line, we used our L\ bound (4.17) for V n , and the bound (8.1) for V^, and in 
the last line we used our bound on the sup norm of gW . To bound the second term in (8.4), one 
needs 



Lemma 8.1 Let h be absolutely continuous with h(0) = 0. Then 



II^IL m <^II^IL 



(8.6) 



Proof. Observe that if x > 0, 

\Q'(x)h(x)W{x)\ 



= Q'{x)W{x) / h'{t)dt 
' \\h'W\\ T 



Q'{x)W{x) / W^{t)dt 
Jo 



(8.7) 



We now assume that Q' is increasing, and 



hSl = 0. 



A"oo Q'(t) 2 

This latter condition is true for regularly behaved Freud and Erdos weights such as T*,£. Choose 
A > such that 

Q"{t) . l 



< 



Q'itf ~ 2' 



t > A. 
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Then 



and 



If x > A, an integration by parts gives 

/ W~\t)dt = / Q'(t)~ 1 Q'(t)W~ 1 (t)dt 

J A J A 

< Q'{x)~ 1 W~ 1 {x) + - / W-\t)dt. 

2 J A 

J w-\t) dt < 2Q'( X y 1 W- 1 (x) 

x>A^ Q'(x)W{x) I W- X {t) dt < 2. 
J A 

As Q'W is bounded in (0, oo), we obtain 

x > Q'(x)W(x) / W _1 (i) dt < C. 
Jo 

A similar bound holds for x < 0, and then we obtain (8.6) from (8.7). □ 

Next, we apply (8.6) to the second term in the right-hand side of (8.4), with h = V 4ri 
V in [g] (0). We obtain 

\V 4n [g] Q'W\ (x) < \V 4n [g] (x) - V 4n [g] (0)| \Q'(x)\ W(x) + V 4n [g] (0) \Q'(x)\ W(x) 
< C\\VL [g] W\\ Loo{R) + C\V 4n [g] (0)| < C— \\gW\\ Lao(m < C-, 



so 



V n [f] W 



V n [f] V 4n [g] Q'W 2 
Combining (8.4), (8.5), and (8.8) gives 

71 

\\v r [[f}w\\ Lm <c-\\fw\\ Lim 

Thus, recalling (8.1), we have that for both p = 1 and p = oo, 

\K[f}w\\ Lpm <c-\\fw\\ Lpm . 

By interpolation, we obtain 

\K[f}w\\ Lpm <c-\\fw\\ Lpm , 



< c- \\fw\\ Li{m) . 
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for all 1 < p < oo, where C ^ C (n,P). Applying this with / = P, a polynomial of degree < n, 
gives 

Tl 

\\P'W\\ Lp{R) <C—\\PW\\ Lpm . 

This method is elegant, and yields more than just Markov-Bernstein inequalities. As we have 
seen, we also obtain a proof of a Jackson-Favard type inequality, bounds on Cesaro means of 
orthogonal expansions, and other useful information. Our main technical ingredients were bounds 
on Christoffel functions and their derivative analogue (8.2), and some technical estimates involving 

Qf. 



8.2 Replacing the Weight by a Polynomial 



This method is very simple, but applies only to a limited class of weights. Suppose that for some 
fixed K > 0, we have polynomials S n of degree < Kn, with 



Ci < S n /W < C 2 in [-2a n ,2a r , 



and 



Tl 

S' n \/W<C 3 — in \-a n ,a n ). 



(8.9) 



U0) 



These polynomials enable us to reduce weighted Bernstein inequalities to classical unweighted 
Bernstein inequalities. If P is a polynomial of degree < n, then our restricted range inequalities 
give 



\P'W\ 



< C\\P'W\\ T . . < CC7 1 \\P'S n \\ T r . 

— II \\L v [—an,an\ — i II "IIL P [— a„,a n \ 



< CC 1 J] (PS n ) llx.jjj— a „ )0n ] lr ^n||£, p [_ anj0n ] 



< cc; 



2n „ 

— PSn 



n 

L p [-2a n ,2a n ] + ~~ II "^11 



by our hypotheses on S' n , and the classical Bernstein inequality, scaled from [—1, 1] to [— 2a n , 2a n ]. 
Using (8.9) again, we obtain 



\P'W\ 



< C\\PW\ 



This works in any L p , < p < oo. If p = oo, we can weaken the requirements on S n , which can be 
made different for each x. We only need the upper bound on S' n at a given x, and the lower bound 
on S n at that x (but we still need the upper bound on S n throughout [— 2o n , 2a n ]). 

Freud and Nevai used this for weights like W2 m ( x ) = exp(— x 2m ), where m > 1 is a positive 
integer. The partial sums of these entire weights can be used for {S n }. For W a ,a > 1, Eli Levin 
and the author used canonical products of Weierstrass primary factors, such as 



oo 



n=l 



to generate these polynomials [80], [SI]. This method can provide quick easy proofs in special cases, 
which would be useful for teaching a course on weighted approximation. 
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8.3 Nevai-Totik's Method 



This involves fast decreasing polynomials, a topic initiated by Kamen Ivanov, Paul Nevai and Vili 
Totik, and works well for slowly decreasing weights such as W a ,a < 1. Let us suppose that for 
n > 1, and some K > 0, we have polynomials Sn that satisfy 



and 



S#(0) = 1, Sf (0) = 0, 



S#(x) <Ke~ Q ( x \ xe[-a n ,a n ] 



ill 



1.12) 



Typically, SJl has a unique maximum in [—1,1] at 0, and decreases rapidly in (0,1). Nevai and 
Totik |151j used a construction of Marchenko to generate such polynomials. One starts with an 
entire function of the form 



B(z) 



n 

fc>i 



1 + 



t k 



The product may be finite or infinite, and the are positive numbers with 

T = Vi<oo. 

Assuming that Q is even, increasing on (0, oo), and 

Q(x 



1 + x'' 



dx < oo, 



1.13) 



one can choose {t^} such that T < 1/n and 

\B(x)\ < Kexp(-Q(x)), x£R. 

In this case B is an entire function of exponential type < 1. Assuming (8.13), Nevai and Totik used 
the partial sums of B to construct polynomials P n of degree < n, with a local maximum at 0, and 



P„(0) = 1 and \P n (x)\ < K exp(-Q(nx)), x G [-1, 1] . 



U4) 



Then 



S*{x) = P K 



[On] 



satisfies (8.11) and (8.12), and is of degree < [a n ]. It is easy to derive the Markov inequality at 0: 



(P'W)(0)\ = |P'(0)| = (PSjf)'(O) 



ra+ LOnJ ||po#i| 

— „ IK ?! 1 1 -Loo [-In, In] 



by the usual unweighted Bernstein inequality. We continue this as 

\(P'W)(0)\<C\\PW\\ L , an] , 



1.15) 
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using the property (8.12), and the fact that the convergence (8.13) implies n = 0(a n ), which we 
shall not prove. To extend this to general x > 0, we use the evenness and concavity of Q, which 
implies that for x, y G R, 

W(x)W(y) < W(x + y). 
Now apply (8.15) to the polynomial R(y) = P{x + y)W(x), for fixed x: 

\(P'W)(x)\ = \R'(0)W(0)\<C\\RW\\ Loo(R) =C S up\P(x + y)W(x)W(y)\ 
< C sup \P(x + y)W(x + y)\ = C ||PW|| Loo(R) . 

y&R °° V ' 

This method can be modified to give the correct Markov inequality for W\. Nevai and Totik were 
the first to do so. 



8.4 Dzrbasjan/Kroo— Szabados' Method 



Dzrbasjan was apparently the first researcher to investigate the degree of approximation for general 
exponential weights, in his 1955 paper [37]. Although his approximation estimates worked only on 
a finite interval, he nevertheless came up with a great many ideas. Kroo and Szabados subsequently 
used Dzrbasjan's method, and restricted range inequalities to establish Theorem 7.1. 

We start with Cauchy's integral formula for derivatives (or, if you prefer, Cauchy's estimates): 



\P'(x) 



J_ r P(t) 
2iri J\ t - X \=e (t ~ x)'- 



-dt 



< -sup{|P(t)| :\t-x\=e} 



The number 1/e is invariably chosen as the size of the Markov-Bernstein factor. To estimate P(t) 
in terms of the values of PW on the real line, we write t = u + iv, and use an inequality that often 
arises in the theory of functions analytic in the upper half-plane. We already used one form of this 
in the proof of Lemma 2.2. 



/C 



log|P(s) 



\v, 

;ds < — 



log M + Q(s) 



logM + 



(s — u) 2 + V 2 TT J_ 00 (s — U) 2 + V 2 

Q(s) 



ds 



TT 



(s — u) 2 + v' 



rds, 



U6) 



where 



M := \\PW\ 



Of course, we need to assume the integral converges, which is very restrictive. We shall assume 
more precisely that 

Q(t) 



l + t 2 



;dt < CO, 



and that for s > 1, 



Q(s) ~ sQ'(s) ~ s 2 Q"{s). 
Then the above considerations show that 



1 If I 

log \P'W\ (x) < log - + logM + sup — 

e v tt 



Q(s)-Q(x) 
(s — u) 2 + V' 



ds. 



3.17) 
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The technical challenge is to estimate the integral in the last right-hand side. We break it into 
several pieces. If we assume x > 1, < u < 2x and Q(x) > 0, we see that 

M /- «.)-«.) ./-gw^ 

and 

|«| T Q(s)-Q{x) J e f° Q(s 



h-= — / t ^— — nds < 9- / — ^-ds. 

(Recall that \v\ < \t — x\ < e.) In [0, |] , we see that the integrand is non-positive, as s < u/2 < 



so 



v 



Finally, we handle the difficult central integral 

u = ^ r 12 q[s) ~ <*« 

M r 3 " 72 Q(s)-Q(u) , - , > ^. . 
=: /41 + -^42- 



Here 



|w| f 3u/2 Q(s) - Q(u) , \v\ f u Q{r) - 2Q(u) + Q(2u - r) , 
hl = ~L {s-u? +V > dS =-j u/2 { r-u ?+ v> ^ 



For some £ between r and u, 

Q(r) - 2Q(u) + Q(2u - r) = Q"(£)(r - u) 2 
and one can show that [691 P- 53] 

Q"(0 < cQ&, 

so 

r 2u 



e_ r u Q(r) 
* L/2 r 2 



hi < C- j ^dr. 



Also, 

I42 = \Q(u) - Q(x)\ = Q'(£) \u-x\< CQ\x)e < Ce [ ^fids. 

Jx S 2 

Combining the above estimates, we have shown that for x > 1, 

1 r°° O(v) 

log \P'W\ (x) < log - + logM + Ce / -P^dy. 

e A r + i 
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We simply choose e = 1. Similar estimates hold over (— oo,— 1], and the range [—1,1] is easy to 
handle. This leads to the Markov inequality 

ll^l| ioo(R )<C||PW|| Zoo(R) , 

valid for all n > 1 and polynomials of degree < n. 

When dealing with weights W = exp(— Q), where Q grows faster than \x\ a , some a > 1, one 
typically chooses 

a n 

£=—, 
n 

uses restricted range inequalities, and needs more work to estimate the various integrals. 



8.5 Levin— Lubinsky's Method 

Like the previous method, we use Cauchy's integral formula for derivatives, and then go back to 
the real line. However, instead of using (8.16), we use the potential theoretic functions associated 
with exponential weights. These give analogues of the Bernstein-Walsh inequality for growth of 
polynomials in the complex plane. For p < oo, going back from the plane to the real line is quite 
complicated, and requires Carleson measures. We shall outline the method for 1 < p < oo. 
Fix x > 0, e > 0, and define an entire function F x by 



Observe that 
We have 

\(PW)'{x)\ 



F x (z) := exp(-Q(x) - Q'(x)(z - x)) 
F£)(x) = W ij) (x) for j = 0,l. 

(PF x )(t) 



\{PF x )'{x) 



1 

2vri 



\t-x\=e {t-x)'- 
l/p 



-dt 



< 



-/ \PF x \(tf \dt\) (± [ \t-x\ 

^J\t-x\=e J \ 27T J\t-x\=e 



1/9 



-2q 



1 / 1 



\dt\ 



£ V27T 



\ Vp 

\PF x \ p (x + £e l9 )d8) . 



1.18) 



In the second last line, we used Holder's inequality, and q = there. We shall choose 

£ = £(n,x) = ip n (x), 

where (p n (x) is given by (7.8). This guarantees |85l Lemma 10.6, p. 301] that 

\F x {z)\ <CW(\z\) for \z-x\<ip n (x). (8.19) 

The proof of this involves a Taylor series expansion of Q(Re z) about Q(x). Now comes the potential 
theory bit. There is |85[ Lemma 10.7, p. 303] a function G n analytic in C\ [— a n , a n ], with boundary 
values G n (x) on [—a n ,a n ] from the upper half-plane satisfying 



\G n (x)\ = W(x) 



x e [-a n ,a n ] . 
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Moreover, z n G n (z) has a finite limit at oo, and uniformly for x £ [—a2n, a,2n] , 

W(\z\)<C\G n (z)\, \z - x\ < <p n (x). 
One representation for G n is 

G n (z) = exp 



1.20) 



\og(z - t)dn n (t) - C n , 



where fi n is the equilibrium measure of mass n for the weight W, and c n is an equilibrium constant. 
Recall that we used something similar in deriving restricted range inequalities, in Section 6. Then 
we obtain from (8.18) to (8.20), 

<ft(x)\(PW)'(x)\ p <C r \PG n \P(x + ip n (x)e i9 )de. 

J —IT 



Integrating gives 



/jwr < c 



-ain 



\PG n \ p (x + ^n(x)e ie )d9 



dx 



C J \PG n \ p d[u+ + u~], 



(8.21) 



where v£ is a measure on the upper half-plane, and v n is a measure on the lower half-plane. For 
Borel measurable sets S, with characteristic function xs, we have 



/<J2n [ fit 
/ xs(x + Mxy e )d6 dx- 
-Cl2n \-J0 

/ Xs(x + Vn{x)e ie )d9 dx. 

-0,2n \-J — 7T 



Next, we use a famous inequality of Carleson. We say a measure v on the upper half-plane is a 
Carleson measure if there exists A > such that for all squares K in the upper half-plane, with 
base on the real axis, and side h > 0, 

v{K) < Ah. 

The smallest such number A is called N (i/), the Carleson norm of v. Let H p denote the Hardy 
space in the upper half-plane, consisting of all functions analytic there, whose boundary values on 
the real axis lie in L p (M). For / G H p , 



\f\ p du < CN{v) 



(For p < 1, there is an analogous statement.) 
Applying this to (8.21) gives 



V* | (PW)'\ P <C (AT [„+]+ N [„"]) / |PG n r. 



-a2n 
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Here although v n is a measure on the lower half-plane, it is obvious what is meant by its Carleson 
norm. Next, one shows that 

sup (N [u+]+N[u~]) <oc. 

n 

Finally, 

pa,n ra n 

\PG n \ p = / \PW\ P , 



and one can show using Hilbert transform estimates that 

r ra n 

/ \PG n \ p <C \PW\ P , 

J\x\>a n J —a n 

giving 



/<J2n ra n 
Vn\{PW)'\ P <C \PW\ P . 
-ao.n J —a n 



The integral over R\ [— a2 n ,02n] rnay be handled using restricted-range inequalities. 

This method is the deepest of those we presented — and it is the only one that gives the 
Bernstein inequalities in Theorem 7.4 and 7.6. For p = oo, the proof is easier, as one can avoid 
Carleson measures [82], [133] . 

8.6 Sieved Markov— Bernstein Inequalities 

The methods of this section have been extensively developed by Paul Nevai and others [105] . [108J, 
and illustrate the power of Jensen's inequality. On finite intervals, they go back at least to Zygmund 
[193J. The idea is to start with the Christoffel type estimates 



77 f°° 

(PW) 2 (0 < C— (PW) 2 , (8.22) 

J— oo 



and 

3 



(P'W) 2 (0 < c(-) r (PW) 2 , (8.23) 

\ On / J — oo 

valid for Freud weights W, n > 1, polynomials P of degree < n, and £ G R. We used these already 
in Section 8.1. 

We now extend these to L p type Christoffel function estimates. Let < p < 2. From (8.22), we 
derive 



and hence 

W p W\\ P Locm <C^ T \PWf. (8.24) 

a n J — oo 

Next, (8.23) followed by (8.24) give 

3 



\r'nLm < c(f ) WPwf-^J \pw\ p 



- c [-\'(- r\pw\ p \ 7 r\pw\ p . 



Q>n J V^n. J —oo / J —oo 
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Rearranging gives 



P+l fOO 

\P'n P T,„<m<C - / |FWf. (8.25) 



a, 



CO 



Thus far we have (8.24) and (8.25) for < p < 2. Our weight is W = exp(— Q). We apply these 
inequalities instead to the weight exp(— rQ) for fixed r > 0. Its Mhaskar-Rakhmanov-Saff number 
is a multiple of that for W = exp(— Q). Since for any fixed s > 0, 

a sn ~ a n uniformly in n, 

we obtain 

II PWI, 
\L 

and 



n r°° 

PWY,,„<C- \PW*\ P . (8.26) 



W^WL (m <C{^) JJPW1', (8.27) 

for all < p < 2, r > 0, rt > 1, and P of degree < n. 

Now comes the sieving idea. Let L, M be positive integers and fix We apply these inequalities 
to the polynomial in t, 

P(t) = S(t)(K M n(Z,t)) L , (8.28) 

where S has degree < n, and 

n-l 
3=0 

is the nth reproducing kernel for the weight W 2 . As 

(LM + l)n n 

r^i j 

a (LM+l)n «n 

uniformly in n, we obtain from (8.26), 

{|5(C)Kf fn (e,0|^ r (0} P <C- / |(5W r )(t)^ n (e,t) i | P cft. 
Here, for Freud weights W <E J 7 * [83] . 

= l/An(^ 2 ,6 ~ -W~ 2 (0, lei < Un, (8-29) 

while 

K n {£, = l/X n (W 2 , < C— W~ 2 (0, £ G K, (8.30) 

so if M is so large that 

-iMn > 2a n , 

we obtain for all polynomials S of degree < n, and all |£| < 2a n , 

/ \ l-ip roc 

\S(Z)\PW<r- 2L >(0 < C — / |(STn(i)ZW£,0T^. 



P+l POO 
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Now assume that L is chosen so large that Lp > 2. By Cauchy-Schwarz, for all n and t, £ G R, 

Lp-2 



|^Mn(^,t)| L ^ 2 < (^Mn(e,6 1/2 ^Mn(t,t) 1/2 ) J 

/ \ Lp-2 

< C( — ) (W'^OW' 1 ^))^- 2 . (8.31) 



So 



\S(Z)\ p W( r -^ p - 2 (0 < C— / |5(t)| p K M n(e,i) 2 ^ (r " L)p+2 (i)^- (8-32) 
We now choose r = L + 1, giving 

i(w)(or<c^ / i(w)(t)r[^(o^(t)K M n(e,t)] 2 ^, (8.33) 

valid for all < p < 2, n > 1, S of degree < n, and |£| < 2a n . Since (by orthonormality) 

/oo „ 
K 2 Mn {i,t)W{t) 2 dt = K Mn (tO ~ — T^- 2 (0, (8-34) 
-oo a « 

we can also write this as: 

Lemma 8.2 Let < p < 2, n > 1, S of degree < n, and |£| < 2a n . Then 

f°° |(SlF)(t)|^ M n(£,t) 2 VF 2 (t)dt 

From this follows: 

Lemma 8.3 (Fundamental lemma of sieving) Let ^ : [0, oo) — > [0, oo) be a convex increasing 
function of x, with ip(0) = 0. Let p > 0, n > 1, S 1 of degree < n, and |£| < 2a ra . Then 

(°° ^(C\SW\P){t)K M M,t) 2 W 2 (t)dt 

If in addition, for some A > 0, 

^(2t)<A0(t), t€[0,oo), 

then 

f°° V(l^l P )(i)-^Mn(e,*) 2 ^ 2 (*)^ 

*«™^° • (8 ' 37> 

Proof. For p < 2, this follows from (8.35) by a single application of Jensen's inequality. For 
general p, one instead applies Jensen's inequality with the convex function t i— ► z/> (t a ), with large 
enough a. □ 
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Next, we extend this to derivatives. We again use the polynomial P of (8.28). We let 

K' n (x,t) = ^K n (x,t). 

By (8.25), 

S'iOKMnit, + L (K Mn (£, Of' 1 K' Mn {£, 0S(0 P W r ?(0 



< C - / \(SW r )(t)KMn(Z,t) L \ P dt. 

\ a n J J-oo 



Here we recall (8.29) and (8.30), while Cauchy-Schwarz and (8.2) give for all x,t G R, 

(Mn-l \ X / 2 /Mn-l \ X / 2 . ^2 

E *4(o 2 E p*w 2 <^ (f ) ^(o^w- 
fc=o / \ fc=o ' V «/ 



Then 



|s'(o| p w(o (r " i)p_2 < c(— V i5(or^ (r_i)p ~ 2 (o 



+cf-y 1 r i5(t)i^ M n(^t) 2 w (r - L)?,+2 (i)^ 

< cf— y 1 /°° i,s(t)rKMn(e,t) 2 w (r - L)p+2 w^, 



by (8.32). We now choose r = L + 1, giving 

p-i 



/ 77 \ P f°° 

s^ra < c — / i wrw [w(o^(t)^Mn(e, of (s.ss) 

\ a n / J —oo 



which we can reformulate as 

Lemma 8.4 Let < p < 2, n > 1, S 1 of degree < n, and |£| < 2a„. Then 



15 w| (0 - c UJ w^^w* • (8 - 39) 

From this follows: 

Lemma 8.5 Let V : [0, oo) — > [0, oo) he a convex increasing function of x, with ^(0) = 0. Let 
p > 0, n > 1, S of degree < n, and |£| < 2a ra . Then 

S-cc <l'( c i sw ')('«, t) 2 w 2 W* 
* |s ™ £ J±1WU)TO • (8 - 40> 

If in addition, for some A > 0, 

V?(2t) < A^(t), t G [0,oo), (8.41) 

then 

IZ> *( i: sw ")(t)KM„((, tfw\t)dt 
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Proof. The proof uses Jensen's inequality as in Lemma 8.3. □ 

The inequalities (8.40) and (8.42) are useful for more than Markov-Bernstein inequalities. But 
for the moment, we deduce by integration and restricted range inequalities [108J: 

Theorem 8.6 Let W S J-* . Let ip : [0, oo) — > [0, oo) be a convex increasing function of x, with 
V'(O) = 0. Let p > 0, n > 1, S of degree < n. Then 

^{\S'W\ p ){i)di < / iP{C— \SW\ p )(t)dt. (8.43) 

) J — oo a n 

If in addition, for some A > 0, (8.41) holds, then 

POO POO 

/ *l>(\S'W\ p )(£)d£ <C ^{—\SW\ p ){t)dt. (8.44) 

J —oo J —co ®n 

9 Nikolskii Inequalities 

We already proved an inequality of this type in the last section: 

/ n \ 1/p 

W p w\\ Loom < c [-) \\PW\\ Lpm , 

for n > 1, and polynomials P of degree < n — recall (8.24) and (8.35). Thus we compared the 
weighted sup norm and weighted L p norm of a polynomial. More generally, inequalities that com- 
pare the norms of polynomials of degree < n in different spaces are called Nikolskii inequalities. 
They are not difficult to prove, here is a sample: 

Theorem 9.1 Let W G J-* ■ Let < p,r < oo. Then for n > 1 and polynomials P of degree < n, 

\\PW\\ LAM) <CN n (p,r)\\PW\\ Lr{M) , 

where 



( i_i 

p r 



ail , r > p 

r < p 



i_i 

n \ r P 



Proof. If first oo > r > p, we can use restricted-range inequalities, and then Holder's inequality: 
W p W\\ P Lpm <C \PW\ P <C[ \PW\ p A / 1 <C\\PW\\ p Lr{w) a n '\ 

J —a n \J —a n / \J — a n / 

that is 

\\PW\\ Lpm < Cat" \\PW\\ Lr{m . 
If p = oo, the proof is easier. Next, if r < p < oo, we use 

/oo 
\PW\ r . 
-oo 
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Letting 



we obtain 



A 



\PW\ 



n tP .= SUp 

deg(P)<n 



\PW\ 



\PW\\ Lpm <Al P *\\PW\\ p Lr 



(9.1) 



In the case of Freud weights W £ T* that grow at least as fast as \x\ a , some a > 1, we know from 
(8.24) that 



For the canonical weights W a , we have 



n 



1-1/ a 



□ 



a > 1 



log(n + 1), a = 1 
1, a < 1 



The sharpness of these was prov6(l by Nevai and. Totik |143j , |152j. Observe that A n) p grows 
independently of p. In general, it seems that 



A niP ~ SU p\-\W 2 ,x)W'(x). 



For general convex exponential weights, it is known \85\ p. 295] that: 

Theorem 9.2 Let W £ T even (C 2 ) be as in Definition 7.5. Let < p, r < oo. Then for n > 1 and 
polynomials P of degree < n, 



\PW\\r m <CN n (p,r)\\PW\\r 



(R) 



where 



N n (p,r) 



r > p 



r < p 



10 Orthogonal Expansions 

There is an old mathematical saying that L%, L2 and L^ were invented by the Almighty, and man 
invented all else. The author heard this many years ago, but was interested to see it used as the 
opening quote in a chapter of Simon's treatise |172j . That orthogonal expansions naturally live in 
L2 is obvious. Among the many manifestations of this, is the best approximation property 

II (/ " [/]) W\\ Lm = E n [/; W] 2 = dcg inf <m || (/ - P)^|| L2(R) . 

This ensures that when the polynomials are dense (in an obvious sense) 

lim ||(/-Sn[/])^||w R )=0 
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for all functions / for which fW E L 2 { 

But it is part of the mathematician's spirit to take important tools out of their natural domain, 
so it is not surprising that much effort has been devoted to convergence of {S n [/]} in L p , or in a 
uniform sense, or at a specific point, and so on. A lot of fundamental advances have ensued: for 
example, the boundedness of the Hilbert transform in L p , 1 < p < oo, was established in order 
to prove that classic Fourier series converge in such L p . The theory of A p weights started with 
Muckenhoupt's efforts to prove convergence in L p of Hermite expansions. 

In this section, we shall discuss pointwise and mean convergence. We begin with the latter. 

10.1 Mean Convergence 

Recall the reproducing kernel and the Christoffel-Darboux formula: 

Ty / ,\ , x 7„_ip n (x)j> n _i0) ~ Pn-l(x)Pn(t) 
K n (x,t) = 2^Pj{x)Pj(t) = — — — I • 

j=o 7n x 1 

Define the Hilbert transform 

H [g] (x) = r A dt = lim f A dti 

J —oo x " e^0+ J oo,oo)\(a;— e,x+e) % t 

whenever the limit exists. If g £ Li(R), the transform exists for a.e. x. For a function / with 
fW S L 2 (R), we see that 

/•oo 

S n [f](x) = / K n (x,t)f(t)W 2 (t)dt 



7n-l 



In 

Thus if u is a given function, and 1 < p < oo, 



{ Pn {x)H [p n -lfW 2 ] {x)-p n . l { X )H [p n fW 2 ] (X)} . 



S n [f]Wu 2 \\ < ^h Pn Wu\\ Loom \\H[p n ^fW 2 ' 



In 



u \\l 



+ \\pn-lWu\\ Losm \\H [ Pn fW 2 ] u\\ Lp{R) } . (10.1) 



Next, we use the aforementioned theorem of Riesz that the Hilbert transform is a bounded operator 
on L p , provided 1 < p < oo: For some C ^ C(g), 

W H 19]\\l p (U) - C \\g\\L p {W) ■ 

In his investigations of Hermite expansions, Muckenhoupt |144| considered weighted versions: for 
suitable functions u, 

\\ H [9]u\\ Lp (M) < C\\gu\\ LpW . 
Such u are severely restricted, in particular satisfying Muckenhoupt's condition; examples are 

u(t) = (l + \t\) a , -~<a<l-~. (10.2) 

p p 
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The theorem fails if p = 1 or p = oo, though in L\, other inequalities are available. So we continue 
(10.1) as 



\S n [f] Wu 2 



< C- 



7i 



- { \\Pn 



Wu\ 



\p n -lfW 2 u\ 



+ \\Pn-lWu\\ Loom \\pnfW 2 



(10.3) 



For exponential weights in the real line, recall from (4.14) that 

7n-i 



7n 



< Ca n . 



If we have the bounds 
then for some C ^ C (n, /) , 



\Sn[f]Wu 2 \\ Lp[m <C\\fW\\ Lv{ 



(10.4) 



Once we have such a bound, the reproducing property of S n (that S* n [P] = P when P has degree 
< n) and density of polynomials gives 

n lim \\(f-S n (f))Wu 2 \\ Lp(R) = 0. 



All that is required of / is that fW 6 L p ( 

The problem with this procedure is that the bound (10.4) is hardly ever true. For Freud weights 
like W a , or the weights in the class J 7 *, we typically have 



\Pn(x)\W(x)<Ca- 1 ' 2 



a n 



+ n 



-2/3 



-1/4 



x 6 



(10.5) 



and the upper bound reflects the real growth of p n near ±a„: 

\\PnW\\ Loo{m ~ a-^V/e. 
However, at least in [— pa n ,pa n ], with p G (0, 1) fixed, we do have 

\p n W\ < Ca~ l l 2 . 



10.6) 



This problematic growth of the orthogonal polynomials near the (effective) endpoints of the interval 
of orthogonality, already occurs for the Legendre weight on [—1,1], and more generally, Jacobi 
weights. The same Pollard that solved Bernstein's approximation problem, came up with a fix for 
this. In the context of Freud weights (a similar feature occurs for Jacobi weights), the fix is based 
on the observation that p n — p n -2 has a much better bound than p n : 



\ Pn (x)-Pn-2(x)\W(x) < Ca^ 2 



1 



\ 1/4 

+ n~ 2/3 j x G 



(10.7) 



In particular, 



\p n { X ) - p n -2(x)\ W (x) < Ca^ 2 , X £ R. 
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(Think of p n (cos 9) = cosnO to see from whence this comes.) 

Pollard [158J, |159j . |160j found a clever way to rewrite the Christoffel-Darboux formula to 
exploit this: let us set 

7n-l 
a n = . 

In 

Then (see [90] or |141j for an accessible proof) 

K n (x,y) = K n>1 (x,y) + K n>2 (x,y) +K n>3 (x,y), 

where 

K n ,i(x,y) = p n -x(x)p n -i(y); 



K n , 2 (x,y) = """" 1 p n -i(y) 
a n + a n -i 

K n ,z(x,y) = K n>2 (y,x). 



p n (x) -p n - 2 (x) 



x-y 



Note that there is no x — y in the denominator in K n ^, while in K n>2 and K n ^, we have the term 
Pn — Pn-2 to help. This clever decomposition is not enough on its own; we emphasize that in 
investigating mean convergence, one still has to break up integrals into several different pieces, and 
work over several different ranges. 

For the Hermite weight, the simplest bound is due to Askey and Wainger [2]: 

Theorem 10.1 Let W{x) = exp(— x 2 ). Let | < p < 4. There exists C / C (n, /) such that 

\\Sn[f]W\\ Lpm <C\\fW\\ Lpm . 

This inequality is not true for p < | or p > 4. 

Muckenhoupt found the correct extension to 1 < p < oo. In what follows we let u a (x) = 
(1 + Mr- 
Theorem 10.2 Let W(x) = exp(— x 2 ) and 1 < p < oo. Let 

b<l--; B > —-; b<B. (10.8) 
p p 

Assume in addition that 

f 11 4 4 

-B + maxib, — } + 1<0 if p < -; (10.9) 

{ p) 3p 3 

and 

6-min 1^,1 - -1 + - - — < ifp>4. (10.10) 
I p) 3 3p 

Then 



\\S n [/] Wu b \\ Lp[m < C \\fWu B \\ Lp[K) (10.11) 

for some C ^ C (n, /). If b = B and p = | or 4, then we insert a factor of log (|x| + 2) in the 
right-hand side of (10.11 ). In the case of equality in (10.9) or (10.10), we need strict inequality and 
replace the max or min by their second terms. All these inequalities are also necessary. 
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For general Freud weights, Shing Wu Jha and the author |52j extended Muckenhoupt's result, 
also closing up slight gaps between the latter's necessary and sufficient conditions in "boundary" 
cases: we let 

La j n ) = l (log(n + l)) H , i£a = T 
a ' T \ 1, otherwise. 

Theorem 10.3 Let W = exp(-Q) eF. Let 1 < p < oo, and b, B £ R. 
(a) Then for 

\\Sn[f] Wu b \\ Lp{R) <C\\fWu B \\ Lpm 
to hold with some C ^ C (n, /), it is necessary that: 

(I) (10.8) holds. 

4 
3 

max(fe.-i)-B if 4 -,\ 

a n 1 p$ nHp~ 3 J =0 



(II) If p < | ; then 



1 



(III) If p = I or p = 4, then strict inequality holds in the third inequality in (10.8). 

(IV) If>>4, then 



(b) Assume in addition that the orthonormal polynomials for W 2 also satisfy (10.7). Then the 
conditions above are also sufficient. 



When W = W a , and a n = Cn 1 ^, the conditions above reduce to essentially Muckenhoupt's: 



11 4\ f < 0, 5 ^ 1 - i 



6\ p y I < 0, B = 1 

The bound (10.7) was established for exp(— x 2m ), m = 1,2,3,... in [52]. For general a > 1, 
it follows from results in that paper and work of Kriecherbauer and McLaughlin [68]. It is an 
interesting unsolved problem to establish the bound (10.7) for W £ J-*, or to come up with some 
method to prove it that works without much deeper tools. See [76] for some work in this direction. 
There is also an L\ analogue of these [52l p. 337]. 

Further work on mean convergence of orthogonal expansions appears in [56] , [77j . [123] . [130] . 
[131] . Discrete analogues of orthogonal expansions have been considered by Mashele [112] , [113] . 
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10.2 Uniform and Pointwise Convergence 

One of the simplest and yet most elegant and general theorems on pointwise convergence is an 
extension of the Dirichlet-Jordan criterion, due to Freud [39]. It was subsequently extended by 
Mhaskar [129j . [131J. Its proof is quite simple, and follows readily from estimates of previous 
sections, so we provide it: 

Theorem 10.4 (Dirichlet Jordan Criterion) Let W G T* and let f : E — > R be absolutely 
continuous, with f'W G LiQR). Then 

lim \\(f-S n [f])W\\ Loo{R) =0. 



Proof. Note first that fW is bounded and has limit at ±oo. To see this let A > 0, and x > A. 
Then 



\fW\(x) = 
as W is decreasing. Hence 



f(A) + f r 

J A 



W(x) < \f(A)\W(x)+ / |/'| W, 

J A 



/•oo 

limsup | fW\ (x)< \f\ W. 



Our integrability condition on f'W ensures that this last right-hand side can be made as small as 
we please. So, indeed, f'W has limit at ±oo. 
Next, let m= [n/2\. Then 



IK/ - s n [/]) w\\ Loo(R) < ||(/ - v m [/]) w\\ Loom + \\(v m [/] - s n [/]) w\\ Lo 

Here by Nikolskii inequalities, such as in Theorem 9.1, 



(10.12) 



n 



\\(v m [/] - s n [/]) w\\ Loo(m < c x - \\{V m [/] - S n [/]) w\\ L2{ 



< C \f~{\\( V m [/] " /) W\\ Lm + ||(/ - S n [/]) W|| L2(R) } 

< 2cJf\\(V m [/]-/) W1 L2(R) . 

V a n 



(10.13) 



Recall the L2 best approximation property of the partial sums S n , and that V m has degree < 2m < 
n. Next, 

\\{v m [f]-f)w\\l m < \\(V m [/] - /) w\\ Li(R) \\(V m [f] - f) w\\ L , R] 



< CE^f-W^WiVmif)- f)W\\ Loom 

< c a ^\\fw\\ Lm \\{v m [f]-f)w\\ Lo 
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We used the Jackson-Favard inequality (4.5) in the last line. Since m ~ n, so 
substituting this into (10.13), we obtain 



By 



\\(v m [f]-s n [f])w\\ Locm < c 

so (10.12) becomes 

\\(f - S n [f])W\\r m < \\(f-V m [f])W\\ L 



\f'w\ 



\\(V m [f]-f)W\ 



1/2 



+ c 



< C 1 [E m [f;W} 00 + E m [f;W] 



\f'W\ 



1/2 



\\(Vm[f]-f)W\ 



1/2 



by Theorem 4.8 again. Since fW is continuous, and has limit at infinity, we have E m [/; W]^ — > 
as n — > oo. □ 

This theorem is attractive because of the simplicity of the hypotheses. We don't need to assume 
anything about the "tail" of /. There are also results which give convergence at a point, under 
local assumptions. A result of Mhaskar [136, p. 224] is the archetype: 

Theorem 10.5 Let W € T* and in addition assume that Q" is increasing on (0, oo). Let f be of 
bounded variation, and x be a point of continuity of f. Then 



\Sn[f] (x)-f(x)\<C(x) 



WW\ + \Hr w\df\ + 



\t\>Ca n 



W\df\ 



Here C(x) depends on x, but not on n or f. 



Mashele [ 1T4~] has established analogues of this for general exponential weights. Unfortunately 
in passing to the more general weights, some of the power of this result is lost. Ky [77] has 
investigated a.e. convergence of orthonormal expansions for Freud weights. Trilinear kernels and 
related expansions have been explored by Osilenker [154J. 

There is an extensive literature on expansions in Hermite polynomials, including transplantation 
theorems [186j . [187] that specifically exploit the structure of the Hermite weight. There are also 
many results for Laguerre and related expansions, on the interval [0, oo), see [119 . 



11 Lagrange Interpolation 



Lagrange interpolation at zeros of orthogonal polynomials is probably the most studied of all 
approximation processes associated with exponential weights. Recall that we denoted the zeros of 
p n , the nth orthonormal polynomial for the weight W 2 , by 

-OO < X nn < £„_!,„ < ■ ■ ■ < X 2n < Xi n < OO. 

Given a function /, we define the nth Lagrange interpolation polynomial to / at {xj n }™ =1 to be 
the unique polynomial of degree < n — 1 that satisfies 



Ln [/] (Xjn) = f (Xjn) , 1 < J < n. 
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One formula for L n [/] is 



3=1 



where {^jn}" =1 are fundamental polynomials of Lagrange interpolation, satisfying 
The {lj n } admit the representation 



ijnw= n 



71 



fc=l,fc# Xjn Xkn 



but because we are using zeros of orthogonal polynomials, there are others: 

^jnfa) 



K n (x, Xjn) 



K n {% jni Xjn) 

(To see this, look at the Christoffel-Darboux formula.) 

Another advantage of interpolating at zeros of orthogonal polynomials is the ease of analysis, 
largely because we have the Gauss quadrature formula. For polynomials P of degree < 2n — 1, 



/oo n 
PW 2 =J2 X inP(xjn), 
3=1 



where 

Xjn — A n (W^ , Xjn) — 1 / Knixjni Xjn) 

are called the Christoffel numbers. Since L n [f] 2 has degree < 2n — 2, and agrees with / at 
{xj n }, we see that 

/oo n 
L n [f} 2 W 2 = J2^nf(xjn). (11.1) 

3=1 

Thus boundedness and convergence of {L n [/]} is closely associated with convergence of Gauss 
quadrature. There are at least two recent monographs devoted to Lagrange interpolation [116] . 



11.1 Mean Convergence 

Just as {S n [/]} converge in L2, so do the {L n [/]}, a famous theorem of Erdos-Turan. Indeed, we 
may think of L n [/] MS cl discretization of S n [/] : 

n roo 
Ln [/] (X) =J2 X 3nf(xjn)K n (x,Xjn) « / W 2 (t) f (t) K n ( X , t)dt = S n [f] {x). 
3=1 J -°° 

At this stage, may I share two Erdos stories with the reader. I was doing a postdoc at the 
Technion in Haifa in 1982. Erdos often visited the Technion, and I was one of several who went to 
lunch with him. Later that day, I carried his suit to the drycleaners, and the reward was some time 
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with him. He asked me what I was doing, and I mentioned numerical quadrature. He replied that 
I should go and read the book of Freud on orthogonal polynomials, which I did, and it changed my 
career. 

Here is another story, which illustrates how little we communicate across fields. In early 2006, 
I had the privilege of visiting Texas A&M in College Station, to give some Frontiers Lectures. I 
mentioned the famous theorem of Erdos-Turan on distribution of zeros of polynomials. At the 
end of the talk, one of the very first comments was "I thought Erdds was a graph theorist". 
Hmmph. He spent a large amount of his life on Lagrange interpolation, orthogonal polynomials 
and approximation theory! 

Since Lagrange interpolation samples a function only at a discrete sets of points, and sets 
of measure zero make no difference to Lebesgue integrals, it is clear that we shall need to use 
something other than Lebesgue integrability. As Gauss quadrature sums are Riemann sums or 
Riemann-Stieltjes sums |18H p. 50], the Riemann integral is the appropriate tool. The following 
result was actually proved by Shohat |171j : Erdos-Turan considered only finite intervals. The 
growth of / at oo poses extra problems for the infinite interval. 

Theorem 11.1 (Shohat's version of the Erdos-Turan Theorem) Let f be bounded and 
Riemann integrable on each finite interval. Assume, moreover, that there is an even entire function 
G with all non-negative Maclaurin series coefficients such that GW 2 is integrable over the real line, 
and 

lim f 2 (x)/G(x) = 0. (11.2) 

|a!|— >oo 

Then 

/oo 
(f-L n [f]) 2 W 2 = 0. (11.3) 
-oo 

Proof. Let P be a fixed polynomial. For large enough rt, L n [P] = P, so 

/OO f'OO 
(f-L n [f]) 2 W 2 = / (f-P-L n [f-P]) 2 W 2 
-oo J — oo 

/OO f'OO 
( / _ P) 2 T y2 + 2 / {Ln[f _ P] f W 2 
-oo J — oo 

/oo n 
(/ " PfW 2 + M/ " Pf&in). (H.4) 
j=l 

In the second last line, we used the inequality 

(x + y) 2 < 2 (x 2 + y 2 ) 

and in the last line we used (11.1). Now if G is not a polynomial (which we may assume, just add 
a very slowly growing entire function if it is), 

lim (/ - P) 2 {x)/G{x) =0. 

|a;|— *oo 

Let e > 0. Choose A > such that 

\x\ > A => (/ - P) 2 {x)/G{x) < £. 
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Then 



n poo 

A in (/ - Pf{x jn ) <e X ^ G (^™) ^ £ Yl X i nG ^ £ / G ^ 2 ' 

j:|x jn |>A J':|ajn|>j4 j=l 

by a special case of the Posse-Markov-Stieltjes inequalities [381 p. 92]. These inequalities estimate 
quadrature sums of absolutely monotone functions in terms of integrals. Next, since (/ — P) 2 X[-A,A] 
is Riemann integrable, while Gauss quadrature sums are Riemann sums, 



lim V Xjn(f ~ P)\x jn ) = I (/ - P) 
• , ^ a J -A 



2 W 2 . 



j:\xj n \<A 

Substituting all these in (11.4) gives 

/OO /'OO /'OO 

(/ -L n [f]) 2 W 2 <4/ (f-P) 2 W 2 + e GW 2 . 
-oo J — OO J — OO 

Here P is independent of e, and by L2 analogues of the Bernstein problem, may be chosen so that 
f^xiif ~ P) 2 W 2 is as small as we please. □ 

Note that if / is of polynomial growth at 00, we can always choose G to satisfy (11.2). For 
large classes of exponential weights, the author [87], [88] constructed entire functions G satisfying 
the hypotheses above, and with 

G{x)W 2 {x) ~ (l + |x|T a , 

any a > 1. So typically, the growth restriction on / is hardly more than that required for (fW) 2 
to be integrable. 

Of course this is a really beautiful result, and its proof is elegant! The extensions to L p require 
a whole lot more work. But like their cousins in orthogonal expansions, a lot of great mathematics 
has come out of these efforts. It was Paul Nevai [149] who obtained the first major result in L p , 
for the Hermite weight. 

Theorem 11.2 Let f be continuous on the real line. Let W(x) = exp (— ^x 2 ) be the Hermite 
weight, and assume that 

lim f(x)(l + \x\)W(x) = 0. 

|ie|^oo 

Let {L n [/]} denote the Lagrange interpolation polynomials to f at the zeros of Hermite polyno- 
mials. Then for p > 1, 

Km \\(f-L n [f])W\\ Lpm =0. 

Note that the pth power of the weight W appears in the integral. Nevai showed that the presence 
of such a factor is necessary — see his still relevant survey for more perspectives on this |147j . The 
author and D. Matjila [106] subsequently extended Nevai's result to general Freud weights: 

Theorem 11.3 Let f be continuous on the real line. Let (3 > 1, p > 1, and W{x) = exp ^— | \x\^ . 
Let A 6 1, a > 0, and & := min{l,a}. Moreover, let 

1 ( 41 

T :=T(p) :=-- a + -max < 0,1 --} . (11.5) 
P 6 [ p) 
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Let {L n [/]} denote the Lagrange interpolation polynomials to f at the zeros of {p n (W 2 , x)} . For 



lim 

n— >oo 



(f-L n [f})(x)W(x)(l + \x\Y 
to hold for every continuous function satisfying 



lim f{x)(l + \x\) a W(x) = 



(11.6) 



(11.7) 



it is necessary and sufficient that 

A > r if 1 < p < 4; 

A > r if p > 4 and a = 1; 

A > r if p > 4 and a / 1. 

Moreover, for (11.6) to hoid for every 1 < p < oo and every continuous f satisfying (11.7), it is 
necessary and sufficient that A > — & + max |l, ^ |. 

This in turn, is a special case of a result for general Freud weights: 

Theorem 11.4 Let W = exp(-Q) G .F*. Let p > 1, A £ 1, a > 0, and & := min{l,o}. Let 
{L n [/]} denote the Lagrange interpolation polynomials to f at the zeros of {p n (W 2 ,x)}. For 
(11.6) to hoid for every continuous function satisfying (11.7), it is necessary and sufficient that 



A>-d + ± 
i-(a+A) Ul-±) _/ ! 



log n 



an n 6 V p 



O(l) 



if 1 < p < 4; 



if p > 4 and a = 1; 



if p > 4 and a/1. 



(11.8 



The main ideas of proof of these results already appear in Nevai's 1980 paper, although, obvi- 
ously, technical details are more complicated for general weights. In analyzing L n [/], one fixes a 
polynomial P and splits the function f — P into two pieces, for example, 



Ln[f~P] = L n [(/-P)x Hfl „, K] 



+ L n 



(/ - P)Xm\[- ia„ia„] 



and handles them separately, using different techniques. One of the reasons for the split is the 
bound 

\p n (x)\W{x)<Ca- 1 ' 2 , \x\<(l-e)a n , 
any fixed e > 0, which does not extend all the way to ±a n ; recall the discussion at (10.5) and (10.6). 

( 1^.1 \ —1/4 

Indeed, it is the fact that p n (x)W(x) behaves roughly like (l — j^J near ±a n that accounts 
for the change in phenomena at p = 4, as well as for the complicated conditions. 

The weighted L p norms of the two "pieces" of / are themselves split into a number of pieces, with 
different techniques used to estimate over different ranges. Tools include estimates of quadrature 
sums in terms of integrals (see Section 12), bounds on Hilbert transforms, and estimates of special 
quadrature sums. Here we shall establish just one of the main estimates, using ideas that go back 
to Richard Askey pQ. 
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Lemma 11.5 Let 

and 

Then for n > 1, 



^ = (/- p )*[-K,K] 

A=||(/-P)(x)W(x)(l + |x|) a || Lao(R) . 



L n [i, n ] (x)W(x)(l + \x\r A 
where C is independent of f,n, P. 

Proof. We let 



[ 2 G ' n 1 2 a ' n J 



(11.9) 
(11.10) 
(11.11) 



and first show that 

Lntyn] (x)W(x)(l + \x\y A 



< CA sup 



G(x) =W(x) -1 (l + |x|) -a 
S n [hG] (x)W(x)(l + \x\)- A 

<i 



Then we show that 



sup 



<1 



S n [hG](x)W(x)(l + \x\y A <c, 



(11.12) 



(11.13) 



giving the result. 

Proof of (11.12). We use duality, but in a very precise form. Let 



5n (x) := sign{L n [4> n ] (x)} \L n [4> n ] (x)r' W*~\x) (1 + |x|)"^ X[ _ i^.i^ 



x . 



Then 



L n [V„] (x)W(x)(l + |x|)" 



-^p [ 2 Ctn 5 2 



in [tpn]9n(x)W 2 (x)dx 

L n ty n ]S n [g n ](x)W 2 (x)dx, (11.14) 



by orthogonality oi g — S n [g] to polynomials of degree < n — 1 . Using the Gauss quadrature formula, 
and the interpolatory properties of ip n , we continue this as 

n 

= ^2^kn1pn(x kn )S n [g](x kn ) < A ^ X kn \S n \g] {Xkn^W" 1 {x kn ) {I + \x kn \y a . 
k=l \xkn\<\o.n 

We estimate the quadrature sum by an integral. These types of inequalities will be discussed in 
Section 12. For the moment, we note that \ kn behaves roughly like ^W 2 (x kn ) and continue this 

as 

poo /»oo 
-co 



/CO poo 
\Sn[9n] (x)\W(x){l + \x\Y a dx = CA S n [g n ](x)h n (x)G(x)W 2 (x)da 
-oo J — oo 

/oo 
g n (x)S n [h n G] (x)W 2 (x)dx, 
-co 
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where h n (x) := sign (S n [g n ] (x)), and we have used the self-adjointness of S n . Since g n vanishes 
outside [— ^a n , ^a n ] , and letting q = we continue this as 



CA / g n {x)S n [h n G] (x)W 2 (x)dx 



< CA 



CA 



g n {x)W(x) (1 + \x\) 

L n [ipn] (X)^(X)(1 + |X|)~ A 



S n [h n G] (x)W(x)(l + \x\y A 

L/p 

S n [h n G] {x)W{x)(l + \x\)- A 



by the form of g n . Cancelling the {p — l)th power of 

L n m (X)^(X)(1 + |X|)- A 

in the extreme left of (11.14), we obtain (11.12). 



Proof of (11.13). We use the ChristoffeKDarboux formula much as we did in the section on 
mean convergence of orthogonal expansions: 

S n [hG] (x) = ^ { Pn (x)H [p n -ihGW 2 ] (x)-p n ^(x)H [p n hGW 2 ] (x)} . 



Recalling the bound (10.6) on p n and the bound 



7n~l 
In 



< Ca n , 



we obtain 



We next let 



and split 



\S n [hG] W\(x)< Call 2 Yl \ H [Pj hGw2 ] ( 3 

j=n-l 



X n X[-§a n ,fa n ] 



\S n [hG] W\(x)< Ca n /2 \ H [Pj h Gx* n W 2 



j=n-l 



+Ca 1 J 2 Y, \H[ Pj hG(l-x n )W 2 }(x)\ 

j=n-l 



(11.15) 



We next use weighted bounds for Hilbert transforms, of the type established by Muckenhoupt |144j . 
Let 

A = min { A, s\ , 

P J 
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for some small 5 > 0. Then by Muckenhoupt's bounds for weighted Hilbert transforms, 

n 

«n /2 E \\H[p 3 hG X * n W 2 ](x)(l + \x\r A 



j=n-l 



< Call 2 £ ||(p^G X ;W 2 )(x)(l + |x|)- 
3=n-l 

-a-A 



< C 



(/iGW)(x)(l + |x| 



< c 



1+ x 



<c, 



(11.16) 



as (a + A) p > 1. (Recall our hypotheses in Theorem 11.4 — they imply A + a > -.) Next, for 



M < ^n, and j = n — 1, n, 

l^[ P ^G(i- x ;)^ 2 ] (x)| 



(p^GW 2 ) (t) 



{t:|i|>|a„} 



■ C 



( Pj wy 



x — t 

n l/2 



< C 



{*:|*l>|on} 
1/2 



| Pi W| (t)t 



-l-a 



dt 



So 



4/ 2 £ |^[MG(l- X ;)^ 2 ](x)(l + |x|)- A 

i=n-l 

r i, a p > i, 



<Ca"« 



(1 + 1*1 



J P[ 2 



<Ca~ a 



since 



(logn)p, Ap=l, =o(l) 
an , Ap < 1. 

1 



(11.17) 



-a + 



A < 0. 



P 



Combining (11.15), (11.16), and (11.17), yields (11.13) and hence the result. □ 

Of course, this is just part of the estimation, but is the most difficult part. For the full details, 
see [TUB] . 

Mean convergence of Lagrange interpolation associated with Erdos weights has been investigated 
by S. Damelin and the author [25] . [26]. The author has investigated Lagrange interpolation 
associated with exponential weights on [—1,1] [M], while D. Kubayi and the author investigated 
mean convergence associated with general exponential weights [75]. The convergence in L p , p < 1, 
has been investigated by Matjila [125] . 

One obvious question is whether other interpolation sets lead to cleaner, or simpler, results. It 
was J. Szabados [176j . |192j who came up with this idea, the "method of additional points", in the 
context of Lebesgue functions, which we shall discuss shortly. Szabados' idea was to add two extra 

( 1^,1 N —1/4 

points of interpolation, near ±a n , to damp the growth of p n (x) (recall, roughly ( 1 — jp j ) near 
±a n . Suitable points are those where p n W attains its sup norm on the real line: 



\PnW\{t n ) = \\ Pn W\ 
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One can show that 
and that 



\l-tn/a n \ <Cn- 2 / 3 



\x\ n ~ Cn| /a n ~ n 2/3 . 

We let L* n [f] denote the polynomial of degree < n + 1 that interpolates to / at the n + 2 points 

{ Cm X nrll %n—l,m ■ • • j ^-2n; X\ ni £n} • 

Inspired by Szabados' work on Lebesgue functions, G. Mastroianni and the author [103] proved: 
Theorem 11.6 Let W G T* . Let p > 1, A G R, a > 0, and d := min{l,a}. For 



lim 

n— >oo 



(/-l; [/])(x)^(x)(i + |x|) A 

to hold for every continuous function satisfying 







lim f(x){l + \x\) a W(x) = 

|a;|— »OC 



(11.18) 



(11.19) 



it is necessary and sufficient that 



A > d. 

P 



What a difference the extra two points make! Let us explain a little of the mechanics behind 
this. L* [/] admits the representation 

n 

K If] (X) = f (O t 0n (x) + f (-&,) C+l,n(z) +£/ (X in ) t hn {x), 

3=1 



where for 1 < j < n, 



and 



£jn{x) 



e 



2 ' 



(^jra) (^ Xj n ) 

The factor £ 2 — x 2 damps the growth of p n (x) near ±a n . Of course, there is an extra factor of 
£n ~~ x 2 n in the denominator, but this is large for most Xj n , so actually helps. 

We already noted that convergence of {L n [/]} associated with Freud weights in L p , p < 1, has 
been investigated by Matjila [125] . If one sacrifices precision in this case, one can achieve very 
general results, using distribution functions, rearrangements of functions, and a classic lemma of 
Loomis [3]. The latter asserts that for any {cj}" =1 , 







n 




8 n 


meas < 


.: 


^^^^^ 00 *Xj ' 

j= i o 
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(It is often stated in the case where all Cj > and without absolute values.) If one writes 
L n [f] (x) = p n (x) , ( ^TT^Z V =: Pn{x)g n {x), 

j =1 Pn \ x jn) {X Xjn) 

we see that for any positive function cj) and real numbers 6, c, 

\\Ln [/] W<f> b \\ LpiR) < |K^/ +C ||L 2p (R)ll5nr C ||L 2p (M)- (11-20) 

Next, Loomis' Lemma shows that 

m 9 „(A):=meas{*:| 5n (*)|>A} < g ^7^) 

8||/Wlk 



A 



For the moment, let h* denote the decreasing rearrangement of a function h. In particular, the 
decreasing rearrangement of g n is 

5 n (i) = sup {A : m 9n (A) > tj < sup i A : U n > t V = S2 n . 

Inequalities for decreasing rearrangements give [3] 

imiiju) = fM-f< n^v 2p r= r ^-ry pc 

J — OO J 

< (8||W=|| Loo(R) n n ) ^ ^((^TO) PC dt. (11.21) 

Now if p < i, this integrand poses no problem at 0. If for example, 

</>(t) = l + \t\, 

then 



)*(*) = (1 + g ] , t>o, 

and the integral will converge provided 2p + 2pc > 1. Combining this, (11.20) and (11.21), we 
obtain 

\\Ln[f] Wct> h \\ Lp{w) < C||/^ C || ioo(R) O n || Pn ^ +C || i2p(M) 

with C independent of n, f. Recalling the definition of Vt n above, we have the main part of |97} 
Theorem 3, p. 155]: 

Theorem 11.7 Let W he even and positive in [0, oo) and 4>(t) = 1 + \t\. The following are 
equivalent: 

(I) There exist b, c € R and C,p > such that for every continuous f 

\\Ln[f] W<t> b \\r m < C\\fW<t> C \\ Loa(R) . 
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(II) There exist j3, 7 S M and r > such that 

From this one may easily obtain necessary and sufficient conditions for mean convergence of 
{L n [f]} in L p , some p < ^, with appropriate weights. The striking feature of these results is the 
simplicity of the proofs, and their generality. However, one sacrifices a great deal of precision, for 
even the specific L p is not fixed in advance. For further work on mean convergence of Lagrange 
interpolation associated with exponential weights, see [19], [21], [23], [53], |117j . [118] . |123j . [153J, 
[168] . [169] . 

11.2 Lebesgue Functions and Pointwise Convergence 

Since L n [P] = P for any polynomial P of degree < n — 1, the convergence of {L n } in almost any 
norm is equivalent to the norms of these operators being bounded independent of n. The Lebesgue 
function is the weighted norm of L n [f] at a given point. When working on infinite intervals, it 
makes more sense to investigate weighted version of these. The idea is 

n 

\L n [f](x)\W(x) < 52\f(x jn )\\l jn (x)\W(x) 
j'=i 

n 

< WfW\\ LooW W(x) \tjn(x)\ W- 1 (x jn ) =: \\fW\\ Loo(R) A n (x). 
i=i 

It is easy to see by suitable choice of / that 

A n (x) = sup [\L n [/] (x)W(x)\ : \\fW\\ Loo(R) < l} , 

so the Lebesgue function A ra (x) is indeed a norm. The Lebesgue constant of L n is 

A n := ||A n || Loo(R) . 
Observe that for any function /, there is the pointwise error estimate 

W(x)\f-L n [f]\(x) < (1 + A„(x)) inf \\(f - P)W\\ Loo{R) 

deg(P)<n-l °° v ' 

= (l + A n (x))£ n [/;tt1oo 

and hence the global estimate 

II W{f - L n [f])\\ Loo(R) < (1 + A n ) E n [/; W}^ . 

Amongst the contributors to estimation of A n (x) on infinite intervals are Damelin, Freud, 
Horvath, Kubayi, Matjila, Nevai, Sklyarov, Szabados, Szabo, Szili and Vertesi. The earliest results 
were due to Freud and Sklyarov. 

The following result for general Freud weights is due to Szabados |176j . Indeed, it was in 
this context that Szabados introduced his "method of additional points". Matjila |124j . [126] 
independently obtained the upper bound implicit in (a). 
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Theorem 11.8 Let W = exp(-Q) G T* . 

(a) Let A n denote the Lebesgue constant for the Lagrange interpolation polynomials L n [■] at the 

zeros of p n (W 2 , •) . Then 

A n ~ n 1/6 , n > 1. 

(b) Let A n denote the Lebesgue constant for the Lagrange interpolation polynomials L n [■] at the 

zeros of p n (W 2 , •), together with the points ±£ n where \p n W\ attains its maximum. Then 

A* n ~ log(n + 1), n > 1. 

/ , iN-l/4 

The n ' arises from the factor ( 1 — „ ) that gives the growth of p n near ±a n , together 



with the fact that 

1 _ fllL „ n -2/3_ 
On 

It is remarkable that Szabados' addition of the two points ±£ n gives the optimal factor of logra. 
Yes, that is the same lower bound of logn for the sup norm of projection operators onto the space 
of polynomials of degree < n, in the context of finite intervals |179j . In the context of exponential 
weights, this optimality was proved by Szabados [176J. There are extensions of this result to Erdos 
weights, and exponential weights on (—1,1) [T2], [13], [71], [73], [74] . Typically there one obtains 
the same result for A* , while 

A n ~ (nT(a n )) 1/6 , n > 1. 
Some inkling of the proofs comes from the representation for the fundamental polynomials 

when we interpolate at the zeros of p n - Then 

n n 
A n { X ) = W(x)Y,\^n(x)\W- 1 (x Jn )=W(x)Y,^nW- 1 (x Jn )\Kn(x,X Jn )\ 
j=l j=l 

/oo 
W(t)\K n (x,t)\dt. 
-oo 

Recall that Xj n ~ 2 n L W 2 (xj n ) at least for Xj n well inside the interval [— a n , a n ]. 

For general sets of interpolation nodes, lower bounds for Lebesgue functions have been obtained 
by Vertesi and his coworkers [189], [190] . These show that most of the time the Lebesgue function 
is bounded below by logn [1901 p. 359]: 

Theorem 11.9 Let W S J-* . For n > 1, let A n (x) denote the Lebesgue function corresponding 
to n distinct points (not necessarily the zeros of p n ). Let e > 0. Then there is a set 7i n of linear 
Lebesgue measure < 2ea n , such that 

A n (x) > Celogn, x £ [-a n , a n ] \H n . 

The constant is independent of n,x, e, and the array of interpolation points. 
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Szabados analyzed the influence of the distribution of the interpolation points, and in particular 
the largest interpolation point, on the size of the fundamental polynomials and Lebesgue constant 
|177j . Kubayi [72] and Damelin [T7] explored Szabados' result in the context of more general weights. 
Summability of weighted Lagrange interpolation has been investigated by Szili and Vertesi [183J, 

HBU, EM5]. 

When one increases the degree of the polynomial interpolating at n points from n—1 to |n(l+£)J , 
some fixed e > 0, then one can avoid the factor of logn above. For finite intervals, this was an 
old result of Erdos. For Freud weights, it is due to Vertesi [191], and for exponential weights on 
[-1,1], due to Szili and Vertesi [182]. 

Theorem 11.10 Let W G T*. Let us be given an array of interpolation points {xjn} n> i 1<jXn 
with associated fundamental polynomials of interpolation {£jn} n> i k,-<„- Assume that these admit 
the bound 

SUp \ijn(x)\ W" 1 (Xj n ) W(x) < A, 
x<=R 

for n > 1 and 1 < j < n. Let e > and let f be a continuous function such that fW 1+e has limit 
at ±oo. Then there exist polynomials {P n } such that 

(I) P n has degree N n < n (l + e + Cen~ 2 / 3 ) ; 

(II) P n (x jn ) = f (Xjn), l<j<n; 

(III) \\W l +%f - P n )\\ L ^ m <CE Nn [fiW^)^. 

Note that the fundamental polynomials associated with the orthogonal polynomials p n (W 2 ,x) 
satisfy the above bound [83J . 

Despite the many investigations of Lagrange interpolation, there are relatively few results where 
local assumptions are imposed that guarantee pointwise convergence of {L n [/]}. It seems that the 
following interesting result of Nevai has never been extended beyond the case of the Hermite weight 

Theorem 11.11 Let W(x) = exp(— x 2 ). Let f be Riemann integrable in each finite interval, and 
assume that for some c < ^ , 

WfW c \\ Loo(m < co. 
Assume that on [a,b], f satishes the one-sided Dini condition 

f(x + t) - f(x) > -v(t) |logi| _1 , a<x<x + t<b, 

where v(t) decreases to as t decreases to 0. Then 

lim L n [/] (x) = f(x) (11.22) 

n—roo 

at each point of continuity of f in (a, b). The convergence is uniform in every closed subinterval of 
(a, b) where f is continuous. 

Note that any increasing function / satisfies this Dini condition, so (11.22) holds also when / 
is of bounded variation in [a, b]. (For then / is the difference of monotone increasing functions.) 
Finally, we note that convergence of Lagrange interpolation to entire functions of suitably restricted 
growth leads to a geometric rate of convergence, see [136J. 
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12 Marcinkiewicz— Zygmund Inequalities 

The classical Marcinkiewicz-Zygmund inequality has the form 



<c 2 



where n > 1, i? is a trigonometric polynomial of degree < n, and Ci and C2 are independent of 
n and R [1931 Vol. 2, p. 28]. These types of inequalities are useful, for example, in discretisation 
problems, quadrature theory, and Lagrange interpolation. 

In the context of exponential weights, these have chiefly been used for mean convergence of 
Lagrange interpolation. In fact, the proof of the Erdos-Turan theorem above was based on a 
converse quadrature sum, namely, 



f 

j — < 



Ln [f] 2 W 2 = J2 X Jnf 2 (x jn ). 



This of course is an immediate consequence of the Gauss quadrature formula. In our outline of the 
proof of Lemma 11.5 above we used a forward quadrature sum. Let us repeat this, in simplified 
form: assume that we have an inequality 



n roo 

Y, x mW-\x, n )\PW\(x jn )<C \PW\. (12.1) 

J— 00 



If Q = ^ri, duality gives 

/oo 
Ln [f]gW 2 , 
-00 

where the sup is taken over all g with HffWH^ ( R \ < 1. Using orthogonality of g — S n [g] to 
polynomials of degree < n — 1, and then the Gauss quadrature formula gives 

/oo n 
Ln [/] S n [g] W 2 = SUp^ \jnf(Xj n )S n [g] (Xj n ) 
-00 g . =1 

n 

< E ^nW~ 2 (x jn ) \S n [g] (x jn )W(x jn )\ 

i=i 

/oo 
\S n [g]\W. 
-00 



In the last line, we used the quadrature sum estimate (12.1). Setting a n := sign (S n [g]), and then 
using self-adjointness of S n , we continue the above as 



C\\fW\\ Loo(R) anW-'SnlgjW 2 = C\\fW\\ Loom S n [<y n W~ l ] gW 2 
J— 00 J— 00 

< CWfWW^WS^a^jwW^ 
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If S n is bounded in a suitable sense, we obtain 

\\Ln [f]W\\ Lpm <C\\fW\\ Locm 

with C independent of n. We emphasize that this is a simplified version of what one needs. Typically 
there are extra factors inside the quadrature sum. In subsequent sections, we shall present several 
methods to establish such quadrature sums. 

12.1 Nevai's Method for Forward Quadrature Sums 

We let u £ [xj n , sj_i jn ], p > 1, and start with the following consequence of the fundamental theorem 
of calculus: 

f'-^j — l,n 

\PW\ p (x jn ) <\PW\ p (u)+p ' \PW\ p - 1 (s)\(PWY(s)\ds. 

J u 

We may assume that u is the point in [xj n ,Xj-i tn ] where |-FW| P attains its minimum. We now 
need estimates for Christoffel functions, and spacing of zeros of orthogonal polynomials. In fact 
the former imply the latter via Markov-Stieltjes inequalities. We use 

Xj n W~ 2 (x jn ) = X n (W 2 ,x jn )W~ 2 (x jn ) ~ (p n (x jn ), 

where ip n is defined by (7.8), and 

Xj n Xj — \ n ^ C Xj n W (Xjn). 

Then 

X jn W- 2 (x jn )\PW\ p {x jn ) < C / \PW\ p (u)du 



J it 



j — l,n 

+c<p n (xjn) / ' \pw\ p -\s)\(pwy( s )\ds. 

Summing over j, and using the fact that ip n does not change much in [xj n , Xj_i in ], one obtains 

Y / ^nW- 2 ( Xjn )\PW\ P (x, n ) < C / \PW\ P {u)du 

i=i 00 

/oo 
\PW\ p - 1 {s)\{PW)'{s)\i Pn {s)ds. (12.2) 
-oo 

We apply Holder's inequality, and then the Bernstein inequality Theorem 7.6 to the second term: 

/oo 
\pw\ p -\ s )\ (pwy ( S )\vn{s)ds < \\pw\\ p L - l m \\(Pwy<p n \\ Lp{M) < c\\pw\\ p L - l m \\Pw\\ Lp(m . 
-oo 

Together with (12.2), this gives 

n 

^2 XjnW- 2 (x, n )\PW\ p (x jn ) < C \\PW\\ p Lpm . (12.3) 
i=i 

Here is a sample of what can be proved using this method [107} Thm. 2, p. 287]: 
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Theorem 12.1 Let W <E J 7 * , 1 < p < oo, -oo < b < 2 and a £ E. Then 

Y,^nW- b (x jn )(l + \x 3n \) a \PW\v(x jn )<C / \PW\v(t)(l + \t\) a W 2 - b (t)dt. 

j= l J -oo 

This method has also been used by H. Konig in a vector valued and Banach space setting, with 
W as the Hermite weight |65j . [66| . It was used first for Hermite weights by Nevai [149] . Of course, 
it can be used for points other than zeros of orthogonal polynomials — all we need are suitable 
estimates on the spacing between successive zeros. 



12.2 The Large Sieve Method 

As far as the author is aware, this also was developed by P. Nevai, based on the large sieve of 
number theory, and used by him and his coworkers for exponential weights |105j . We already 
presented the idea in the context of Markov-Bernstein inequalities in Section 8.6. We start with 
Lemma 8.3. For polynomials S of degree < n, 

, < jz\swnt)K Mn (t,t)W(t)dt 

lSWl {0 - C ^K^tyw^dt ' 

which is equivalent to 

poo 

\SW\p (0 < CA Mn (0 / \SW\P(t)K Mn (H,t) 2 W 2 (t)dt. 



Here p > 0, S is a polynomial of degree < n, and |£| < 2a n , while M is a fixed large enough positive 
integer, independent of n, P, £. We now simply take linear combinations of this, and use the bound 



AMn(^ 2 ,0<C-^ 2 (0, lCl<2a r , 
n 



For example, if we use the zeros of p n , we obtain 



n roc 

\SW\P(x jn ) < C / \SW\P(t)E n (t)dt, 

7 = 1 



where 

a 



S„(t) = — W 2 (t) K 2 Mn (x jn , t) W 2 (x p 



n 

3=1 



Here, by the Christoffel-Darboux formula, and our bounds for p n , and recalling our bound (8.31) 
for Kmu we see that for |^| , \t\ < 2a n , 

( n 1 

W(£)W{t)\K Mn (S,t)\<mm{—,- 



Moreover for Freud weights, uniformly in j, n 

Xj n ^7+1 ,rt i— ^ 

n 
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Hence 



^ 2 /*oo 2 

< C — Xj+i tn ) min l^jn ~ ^|| mm |~ 'A u ~ *l| du < C\ 



Thus we arrive at the estimate 



Or, 



^ — (x in ) < c / |sw| p (t)ctt, 



valid for n > 1 and polynomials P of degree < n. We note that this does not imply the estimate 



since 



ra /"OO 

^A jn Ty- 2 (x jn )|SW| p (x in ) < C / |5W|P(t)dt 

-• 1 J— oo 



n 



-1/2 

,_ n -2/3) ^ 

n 



when Xj n is close to =ta n . Thus this method is in some respects not as powerful as Nevai's method. 
However, it can easily yield estimates like 



£ -TP(\SW\v(x jn )) < C / (t))dt 



i=i 

where ^ is an increasing convex function that is positive in (0, oo) and has tjj(0) = 0, and works for 
any set of suitably spaced points. 

12.3 The Duality Method 

The idea here is to use duality of L p spaces to derive a forward quadrature sum estimate from a 
converse one. It was probably first used by Konig [65J, [66J. Let us illustrate this in the context 
of quadrature sums at zeros of orthogonal polynomials. Let n be fixed and fi n be a pure jump 
measure having mass Xj n W (xj n ) at Xj n . Let, as usual, q = If P is a polynomial of degree 

< n, 

n \ l l p 1/ 

J2\ 3 nW-\x jn )\PW\ p (x, n )J = Q \PW\ p d^ P = sup y PW 2 g dfi n , 

where the sup is taken over all g with 

r \gW\ q dn n < 1. 

Since g needs to be defined only at {xj n }™ =l , we can assume that g is a polynomial of degree 

< n — 1. So 

/ n poo 
pw 2 g d» n = J2^n(Pg)(x jn ) = / Pgw 2 < \\pw\\ LpW \\ a w\\ Lqm . 
j=l J~oo 
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Now we assume a suitable converse quadrature sum inequality: for n > 1 and all S of degree < n, 

\\SW\\ Lqm < C \ x jnW~ 2 (x jn ) | SW\ q (x jn ) J . (12.4) 
Applying this to g, gives 

/ n \ 1/q l/ q 

\\9W\\ Lqm < C ( f^X jn W- 2 (x jn )\gW\^x jn )\ = C (| \gW\«d^ * < C. 
Thus we obtain 

^ A jri ^- 2 (x in )|SWr(x in )j < C||PW|| Lp(R) . 

In Konig's work, he first proved a converse inequality like (12.4) for 1 < q < 4, and then deduced a 
forward estimate for | < p < oo. Note that one can also use Carleson measures to derive forward 
quadrature estimates, just as we derived Markov-Bernstein inequalities in Section 8.5. For this 
approach on finite intervals, see the survey |95j . 

12.4 The Duality Method for Converse Quadrature Sum estimates 

This was perhaps the earliest method used for converse quadrature sum estimates. For trigono- 
metric polynomials, it appears in Zygmund's treatise |193[ Ch. X, pp. 28-29]. Let 1 < p < oo and 
q = -^j. Let P be a polynomial of degree < n — 1. By duality, 

\PW\\ Lpm = / gPW 2 , 

9 " 



where the sup is taken over all g with HffW^H^ qr) < 1. Let S n [g] denote the nth partial sum of the 
orthonormal expansion of g with respect to {pj}°^ - By orthogonality, we continue this as 



\PW\\ Lp0t) =snp f S n [g] PW 2 . 

Q J 



9 

Using the Gauss quadrature formula, and then Holder's inequality, we continue this as 



SUp XjnSn [g] (x jn )P(x jr 



9 i=i 



< sup\Y, X jn W~ 2 (x jn )\S n [g] W\ q (x jn ) \ I Yl X jn W- 2 (x jn )\PW\ p (x jn ) 



-: supTi x T2. 
9 
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If we have a suitable forward quadrature sum estimate, and mean boundedness of S n in a weighted 
L q norm, we can bound T\ as follows: 

Ti < C \\S n [g] W\\ Lq(R) < C \\gW\\ Lqm < C. 

Then for n > 1 and all polynomials P of degree < n, 

i/p 

J2^nW- 2 ( Xjn )\PW\ P (x jn ) 
3=1 



\pw\\ Lpm <c( 



This method does yield estimates for full Gauss quadrature sums. However, it requires mean 
boundedness of orthogonal expansions, such as that in Theorem 10.3, which is difficult to prove. It 
also requires a forward quadrature sum estimate, which is not all that difficult to prove. It works 
best for p > 4. For p < 4, the next method gives better results. Here is a sample of what one can 
achieve [91] Thm. 1.2, p. 531]. We note that because we are considering a special case of results 
there, the hypotheses there are simplified below: 

Theorem 12.2 Let W £ J-* , and assume that the orthonormal polynomials {p n } for W 2 satisfy 
the bound (10.7). Let p > 4 and r,R£R satisfy 

R>-- (12.5) 
p 

and 

r-min-Ul-A} kfi_^ f 0(1), R + 1 - h 

- ' U H Often)"*), ■ 



Then for n > 1 and polynomials P of degree < n — 1, 

||(PW0(x)(l + |x|)lr (R)<C< 



Vp 

^A in ^- 2 (x,„)|P^r(x jn )(l + |x in |) fip [> . (12.7) 



For p = 4, this persists provided (12.5) holds and 

r — min < i?, 1 1 < 0. 

I P) 

Weights that satisfy the bound (10.7) include exp(— \x\ a ), a > 1. In [91J, we showed that 
slightly weaker forms of these conditions on r, R are necessary. 



12.5 Konig's Method 

Konig's method [55], [55] is based on a clever estimate for Hilbert transforms of characteristic 
functions of intervals. It is technically the most difficult of all the methods we present (for forward 
or converse estimates), but is very powerful, and relatively direct. It does not depend on mean 
boundedness of orthogonal expansions. For n > 1 and 1 < j < n, let 
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Let P be a polynomial of degree < n — 1, and 



-1/2 



'■— a n i 



Pn( x j 



Then 



P(x) = L n [P](x)=a 1 r / 2 p n (x)J2 



f=1 x x i« 



Xjn \Ijn\ 



H[ Xjn ] (x)\+a 1 r / 2 p n (x)H 



it, 

2^ \ T. |*> 

3=1 IVI 



(s) 



= : Ji(x) + J 2 (x). 
Here (as above), 



H[f](x)= lim / 



|x-i|>e 



f(t) 
t — X 



dt 



is the Hilbert transform of /. The term J 2 is easier, so let us deal with it first. We use the bound 



\p n W\{x)<Ca- 1 ' 2 



1 



(In 



valid when W £ T* , and the Hilbert transform bound [144] 

-1/4 



\H[g] (x) 



1 



' Jp (\x\<2a n ) ^ c \\a{x) 



-1/4 



l 



\Lp{\x\<2a n )i 



valid for 1 < p < 4. Here C ^ C (n,g). Then 

II^W|| Xp[ _ 2an)2an] = aV 2 |KWi? 

< C\\ 1 



Z> I J . \ X 3 n 



J 

1/4 n 



17^7 Xj'n(^) I Up [-2a n ,2a„] 



=1 rjn 



n / \ p 

Vjn X 



-p/4 



1/p 



< c< 



VP 



Here 



il>n{x) = m ax < n ' , 1 



-2/3 - l X l 
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It can be shown that uniformly in j, n, 



\Ijn\ 

Hence 



\PW\ (x jn )^/ 4 (x in ); a l J 2 \p' n W\ (x jn ) ~ l^p 1 ^- 1/4 (x jn ); \I jn \ ~ X jn W- 2 (x 



jnj- 



\J2W\\ 



Lp [ '2a, n ,2ctn.] 



< c< 



^A jri ^- 2 (x in )|PW|f( 
The estimation of J\ is difficult. One shows that for \x — Xj n \ > 2 \ Ij n \ , 



i/p 



X Xj n \Ijn\ 



-H [Xjn] (x) 



aH 2 \p n W\{x) < C^/\x) 

= : Cfj n (x), 



iAj tJj J Yl I I I it/ 



1 1 

r + 



l + \X 



and for \x — Xj n \ < 2 \Ij n \ , 
1 1 



3> Xj n I 



# [Xjn] (x) 



jn\ 



ai/ 2 \ Pn W\(x)<C^Pfl=:Cf jn (x) 

\ I jn\ 



so that 



\Ji(x)\<C^2\y jn \f jn (x). 
i=i 



As each f~ n does not change much in one obtains 



\\J X W\\ 



Lp \Xnn )^ln] 



< C \ T kn\ 



k=2 



^ ] Iz/jnl fjnixkn) 
3=1 



P\ 1/P 



One now splits this sum into three pieces. The main part is estimated by showing that certain n 
by n matrices are uniformly bounded in norm. These in turn depend on clever consequences of 
Holder's inequality. After a lot of technical work, one obtains 



i/p 



\\JiW\\ LpiR) <C ^J2x jn W-\x jn )\PW\P(x jn ) 

Here is a sample of what can be proved |914 Thm. 1.2, p. 531]: 

Theorem 12.3 Let W £ T* and 1 < p < 4. Let 

1 1 

r < 1 ; r < R; R> --. 

p p 

Then for n > 1 and polynomials P of degree < n, 



\\(PW)(x) (1 + \x\Y\\ Lpm <C\Y^ \jnW- 2 (x jn )\PW\v(x jn )(l + \x jn \) R P 

3=1 



(12.8) 



1/p 



(12.9) 



HereC ^C(n,P). 
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There it was also shown that the first two conditions on r, R are necessary. For p > 4, the 
author and Mastroianni used Konig's method to prove [1041 Thm. 1.2, p. 149]. 

Theorem 12.4 Let W E T* and p > 4. Let (12.8) hold, and in addition, assume that for some 
5>0, 

n*Vv)a n 1 p > =0(n~ S ). (12.10) 
Then (12.9) holds for n > 1 and polynomials P of degree < n. 

For further work on quadrature sum estimates in the context of exponential weights, see |18j . 
[23] . [96J. In particular, in the latter paper, an attempt was made to provide a very general 
framework for Konig's method. 



13 Hermite and Hermite— Fejer Interpolation 

The Hermite interpolation polynomial to a function / at the zeros {xj n }™ =1 of p n (x) is a polynomial 
H n [/] of degree < 2n — 1 such that for 1 < j < n, 

Hn [f] (Xjn) = fipjn)i 
H' n [f}(xj n ) = f'(x jn ). 

Of course, we have to assume /' is defined at the zeros {xj n }. To avoid this restriction, the Hermite- 
Fejer interpolant H* [/] is often used instead. It is a polynomial of degree < 2n — 1 such that for 
1 < j < n, 

H nif]( x jn) = f(Xjn), 

K'ifK^n) = o. 

Hermite-Fejer polynomials first came to prominence when Fejer used them (with interpolation at 
zeros of Chebyshev polynomials) to provide another proof of Weierstrass' approximation theorem. 
A close relative of the Hermite-Fejer interpolation polynomial is the Griinwald operator 

n 

Yn[f] (x) = ^2f(x jn )^ jn (x), 

3=1 

where {£j n } are the fundamental polynomials of Lagrange interpolation taken at the zeros of p n (x) = 
Pn (VF 2 ,x). Y n has the advantage of being a positive operator, which makes convergence more 
general, though possibly at a slower rate than interpolatory operators. 

Convergence of Hermite and Hermite-Fejer polynomials associated with exponential weights has 
been investigated in [20], [22], pS], [55], [58], p], [60], [62], [63], [M], [89], [US], [110], p32], pO] . 
In some cases, processes involving higher order derivatives have been studied, and the derivatives 
of the interpolating polynomials have also been investigated. Here we note just one result of Szabo 
[180]: 

Theorem 13.1 Let W = exp(-Q) G T* , with A as in (4.18). Let 

W 2 {x) = W 2 {x) (1 + |Q'(x)|) ad-i/^) log (2+ |Q'(a;)|) . 
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Then for every continuous f : R — > R wit-h 



lim /W 2 (x) = 0, 



we Lave 



n- 



lim ||W 2 (/-^)|| 



= 0. 



Note the square of the weight in the condition on / and in the approximating norm. Szabo 
showed that the factor 



is essential and best possible. 
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15 Summary of Notation 

In this section, we repeat the main notation introduced earlier in this paper, and repeat some 
definitions. Throughout, C, C±, C2, . . . denote positive constants independent of n, x, t and polyno- 
mials P of degree < n. The same symbol does not necessarily denote the same constant in different 
occurrences. We write C = C(a) or C ^ C(a) to respectively show that C depends on a, or does 
not depend on a. 

We use the notation ~ in the following sense: given sequences of real numbers {c n } and {d n }, 
we write 



If a; is a real number, then [x\ denotes the greatest integer < x. 

Throughout, W = exp(— Q) is an even exponential weight. Associated with W are the Mhaskar- 
Rakhmanov-Saff numbers a n = a n (Q), the positive root of the equation 



If xQ'(x) is strictly increasing, with limits and 00 at and 00 respectively, then a n is uniquely 
defined. Corresponding to the weight W, we define orthonormal polynomials 



(1 + iQ'(x)l)^ 1 ^) log (2+ \Q'(x 



■)|) 



if for some positive constants C\, C2 independent of n, we have 



Ci < c n /d n < C 2 . 




Pn(x) = Pn (W 2 ,x) = 7„X" + ■ ■ ■ 



7n > 0, 



satisfying 




'mn. 
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Note that the weight is W 2 , not W. We denote the zeros of p n by 

•Erin -^n— l,n ^ ' ' ' ^ 2-2n ^ -^ln- 

The nth Christoffel function for W 2 is 



A n (^ 2 ,x) = inf J '°°\ . (15.1) 

deg(P)<n-l P 2 (x) 

It also satisfies 

71-1 

X n (W 2 ,x) = l/ ' J> 2 (*)- 

j=0 

The reproducing kernel of order m is 

is , .s / n m 7m-lPm(l)Pm-l(i) -Pm-l(s)Pm(t) 
K m (x,t) := 2^Pj(x)Pj(t) = — — — : • 

Our four main classes of weights follow: 

Definition 15.1 (Freud Weights x - ): Let W = exp(— Q), where Q : R —> R is even, Q' exists 
and is positive in (0, oo). Moreover, assume that xQ'(x) is strictly increasing, with right limit at 
0, and for some A, A, B > 1, C > 0, 

A<^4<B, *>C. 

Then we write W € J 7 . 

Definition 15.2 (Freud Weights JF*): Let W = exp(— Q), where Q" exists and is positive in 
(0, oo), while Q' is positive there, with limit at 0, and for some A, B > 1, 

A-l<^%^<B-l, x€(0,oo). 

Then we write W £ J 7 * . 

The canonical examples of in JF* are 

W Q (s) = exp(— |x| a ), a > 0, 

while x 7 * C JF 

Definition 15.3 (Erdos Weights £): Let W = exp(-Q), where Q : R — > R is even, Q' exists 
and is positive in (0, oo). Assume that xQ'(x) is strictly increasing, with right limit at 0, and the 
function 

r ( x) := 2|M (1M) 

is quasi-increasing in the sense that 

< x < y => T(x) < CT(y), 
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while 

lim T(x) = oo. (15.3) 

x— +00 

Assume, moreover, that for some Ci, C2 and C3 > 0, 

c 2 



ycm <r (Q{v)y* > >r 



Then we write W £ £. 



Definition 15.4 (General Exponential Weights F e ven{C 2 ))'- Let I = (— d, d) where < d < 
00. Let Q : I — > [0, 00) satisfy the following properties: 

(a) Q' is continuous and positive in I and Q(0) = 0; 

(b) Q" exists and is positive in (0, d) ; 
(c) 

lim Q(t) = 00; 

(d) The function 

Tit) = 

is quasi-increasing in (0, d), in the sense that 

< x < y < d =► T(x) < CT(y). 
Moreover, assume for some A > 1, 

T(t)>A>l, t€(0,d). 

(e) There exists C\ > such that 

Then we write W G jF ei , era (C 2 ). If also, there exists c G (0, d) such that 

\Q"(x)\ <r Q'(x) 

then we write W G J r ei , era (C 2 +). 

For fc > 1, the fcth iterated exponential is 

exp fe = exp(exp(- • • expQ)) 
Define the symmetric differences 

A fc /(s) = /(* + £) -/(*-£); A k h f(x) = A h (A k h - 1 f(x)), k > 1, 
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so that 



Aj/(x)=x;(*)(-i)v(x+^-t/i). 



i=0 

Given r > 1 and a Freud weight VF = exp(— Q) with Mhaskar-Rakhmanov-Saff numbers {a n }, we 
define the rth order modulus of continuity as follows: First define the decreasing function of t, 

a(t) := inf |a n : ^ < t} , t > 0. 

The rth order modulus of continuity for W is 

uj r . p {f,W,t)= sup ||W(A)i/) Hl^-^),^)] + inf ||(/ - P)W\\ Lpm _ a(t)tam . 

0<h<t deg(_Pj<r— 1 

The rth order K-functional associated with W is 

K r , p (/, W,f) := inf { ||(/ - g)W\\ Lpm +f\\g^W\\ Lpm } , t>0. 

For Erdos weights W £ £, the modulus of continuity involves the function 



where cr(t) is as above. The rth order modulus is then 

u r , p (f,W,t) = sup \\W(x)(A r hMx) f(x))\\ Lpha(2t)M2t)] 

0<h<t 



The error in weighted approximation of / by polynomials of degree < n in the L p norm is 

E n [f;W] p := inf \\(f - P)W\\ Lp(R) . 

acg(P)<n 

If / is such that fW 2 G we define its orthonormal expansion 

oo 
3=0 

where 

/oo 
fPjW 2 , j > 0. 
-oo 

The nth partial sum of the orthonormal expansion is 

n-l 

Sn[f] =J2c jPj , 
3=0 
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while the nth de la Vallee Poussin operator is 

- 

j=n+l 

Given a function /, we define the nth Lagrange interpolation polynomial to / at {xj n }™ =1 to 
be the unique polynomial of degree < n — 1 that satisfies 

L n [f] (Xjn) = f(Xj n ), 1 < j < n. 

One formula for L n [/] is 

n 

L n [/] = J2f(x jn )e jn 

3=1 

where {^n}J =1 are fundamental polynomials of Lagrange interpolation, satisfying 

£jn{xkn) = $jk- 

The {£j n } admit the representation 

n 
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